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fr-  This  experiment  investigated  the  relationships  between  speed,  level, 
and  complexity  in  individual  differences  in  spatial  ability.  Thirty  male 
high  school  and  college  students  representing  a  wide  range  of  individual 
differences  in  verbal  ability  and  spatial  ability  were  required  to  match, 
rotate,  combine,  and  combine  and  then  rotate  three  to  eight  point  polygons 
Three  levels  of  construction  (one,  two,  or  three  stimuli),  three  levels  of 
stimulus  complexity  (low,  medium  or  high),  three  levels  of  rotation  [> 
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(0,  90,  or  ISO  degrees),  and  two  types  of  discr Inina tive  response 
(correct  or  incorrect)  were  fully  crossed  in  the  within  subjects  design. 
Additionally,  type  of  addition  (left  or  right)  and  complexity  of  the 
to-be-constructed  stimulus  (low  or  high)  were  nested  within  levels  of 
the  construction  facet  but  crossed  with  each  other  and  all  other  facets. 

Correctness,  confidence,  tocal  time,  and  from  two  to  four  component 
tines  were  obtained  on  each  item.  Data  were  analyzed  using  regression, 
analysis  of  variance,  and  correlational  techniques. 

In  the  regression  analyses,  latency  measures  were  regressed  on 
percent  of  Che  sample  failing  each  item.  Correlations  between  intercepts, 
slopes,  predicted  times  at  maximum  difficulty,  and  level  scores  anu 
reference  constructs  supported  the  basic  hypothesis  of  speed-level 
independence,  In  particular,  speed  of  solving  simple  items  was  independent 
of  both  level  on  the  task  and  reference  abilities.  Speed  of  solving 
complex  items  was  strongly  correlated  with  level  on  the  task,  but  more 
highly  correlated  with  verbal  chan  spatial  reference  tests. 

The  major  results  of  the  means  analyses  were:  a)  there  were  large 
facet  interactions  in  all  analyses,  implying  chat  complexity  increased 
nonlinearly;  b)  encoding  time  increased  linearly  with  increases  in 
stimulus  complexity;  c)  rotation  time  and  errors  increased  almost  linearly 
over  Che  three  levels  of  rocation;  when  errors  and  lacency  were  combined, 
Che  increase  was  perfectly  linear;  d)  rotation  was  unaffected  by  stimulus 
complexity,  but  slower  for  constructed  chan  nonconstructed  stimuli; 

e)  construction  was  differentially  affected  by  the  number  of  stimuli  to 

be  combined,  the  complexity  of  these  stimuli,  the  location  of  the  addition, 
and,  most  significantly,  Che  complexity  of  the  to-be-constructed  scimulus; 

f)  for  match  times  "yes"  responses  were  faster  on  correct  items  and  "no" 
responses  faster  on  incorrect  items.  Those  who  performed  well  on  Che 
experimental  task  showd  little  increase  in  match  time  over  the  rotation 
and  construction  facets,  while  those  who  performed  poorly  showed  a  linear 
increase  in  match  time  over  the  rotation  facet  and  a  large  increase  on 
construction  items. 

Relationships  between  individual  differences  in  total  scores,  cell 
totals,  and  the  pattern  of  cell  means  over  design  facets  were  explored 
in  Che  correlational  analyses.  In  general,  correlations  between  correct¬ 
ness  indices  and  reference  tests  supported  the  construct  validity  of  the 
task  and  its  facets,  while  correlations  between  lacency  indices  and 
reference  tests  suggested  more  about  the  processes  or  strategies  subjects 
were  using  to  solve  icems.  The  major  results  were:  a)  tocal  number 
correct  was  strongly  correlated  with  reference  spatial  tests,  while 
average  total  time  was  weakly  related  co  verbal  reference  tests;  b)  steep 
slopes  were  related  to  verbal  ability  for  the  regressions  of  encoding  time 
on  scimulus  complexity;  rotation  time  on  rocation  and  stimulus  complexity; 
and  construction  time  on  construction,  stimulus  complexity,  and  type  of 
addition;  c)  a  linear  decline  in  correctness  over  the  rocation  facet  was 
related  co  high  scores  on  two  speeded  rocation  reference  tescs  (Cards  and 
Figures),  while  a  linear  decline  in  correctness  over  the  construction 
facet  was  related  to  high  scores  on  two  construction  reference  tests 
(Form  Board  and  Object  Assembly). 

S^It  was  concluded  that  the  results  of  the  experiment  supported  the 
hypothesis  of  speed-level  independence  and  provided  important  insights 
into  Che  nature  of  spatial  ability. 
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Research  on  aptitude  for  learning  has  entered  a  new  era. 
Instructional  studies  have  established  that  individual 
differences  among  learners  often  interact  with  instructional 
treatment  variables  (Cronbach  and  Snow,  1977;  Snow,  1977).  nuch 
of  this  work  has  also  underscored  the  need  for  deeper,  more 
process  oriented  understanding  of  the  psychological  nature  of 
aptitudes.  Cognitive  psychologists  have  begun  the  experimental 
analysis  of  individual  differences  in  information  processing,  and 
there  is  now  reason  to  hope  that  coordination  of  these  lines  of 
work  will  lead  to  process  theories  of  aptitude  for  learning  from 
instruction  (Snow.  1976). 

P.C  e.V  i  g.’J  5.  Approaches  t_g  the  Problem. 

Three  paradigms  have  dominated  previous  attempts  to  bridge 
the  gap  between  trait  and  process  descriptions  of  individual 
differences  in  aptitude. 

1.  Direct  modeling  of  the  cognitive  task,  usually  with 
latency  as  the  dependent  measure.  Much  of  modern 
cognitive  psychology  could  be  classified  here. 
Particularly  relevant  examples  are  the  work  of 
Clark  and  his  associates  on  verbal  comprehension 
(Clark  and  Chase,  1972),  and  Shepard  and  his 
associates  on  mental  rotation  (Shepard  and  Metzler, 

1971;  Shepard  and  Feng,  1972;  Cooper  and  Shepard, 

1976).  Although  the  bulk  of  this  work  ignores 
individual  differences  in  performance,  such  need 
not  be  the  case.  The  most  noteworthy  example  is 
the  extensive  work  of  Sternberg  (1977)  on 
analogical  reasoning.  In  addition  to  examining  a 
number  of  sources  of  individual  differences  in 
cognitive  performance,  Sternberg  also  provides  for 
the  possibility  of  modeling  error  as  well  as 
latency  data . 
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: .  Computer  Simulation.  This  approach  is  exemplified 
in  the  work  of  Simon  and  Kotovsky  (1963)  and 
Kotovsky  and  Simon  (1973)  on  letter  series  items. 

In  simplest  terms,  the  goal  of  this  approach  is  to 
develop  computer  programs  that  solve  various  tasks 
with  the  same  relative  ease  that  human  subjects 
solve  similar  items.  The  program  is  considered  a 
good  model  of  human  behavior  if  item  solution  times 
parallel  those  for  human  subjects. 

3.  Correlating  information  processing  parameters  with 
aptitude  constructs.  This  approach  is  exemplified 
in  the  work  of  Hunt.  Frost,  and  Lunneborg  (1973)  on 
the  relationships  betueen  verbal  and  quantitative 
abilities  and  various  task  parameters.  Similar 
work  with  a  wider  range  of  aptitude  constructs  has 
been  reported  by  Snow,  Marshalek,  and  Lohman 
(1976).  Although  there  are  a  number  of  variants  of 
this  technique,  the  goal  is  to  find  a  set  of 
process  parameters  that,  collectively,  predict  an 
aptitude  construct;  or,  alternately,  a  set  of 
aptitude  constructs  that  predict  a  process 
pa  ramet  er . 

pp.  o_L  Experimental  Task  . 

Most  investigations  of  individual  differences  in  cognitive 
processes  have  attempted  to  model  performance  on  a  task  similar 
to  an  established  test  (e.g.,  Sternberg.  1977;  Simon  and 
Kotovsky,  1963).  Since  individual  differences  on  the  source  test 
usually  have  a  known  network  of  relations  with  other  mental 
tests,  the  investigator  ordinarily  does  not  have  to  establish  the 
construct  validity  of  his  task.  The  main  disadvantages  of  this 
approach  are:  a)  the  sources  of  inter-item  variation  must  first 
be  ’’discovered,"  b)  once  enumerated,  they  must  be  experimentally 
disentangled,  and  c)  these  experimental  manipulations  may  destroy 
the  construct  validity  of  the  test,  which  was  the  main  reason  for 


using  it  in  the  first  place.  Since  the  psychometrically  sound 
test  may  be  a  veritable  hodge-podge  of  test  facets,  uncovering 
all  the  dimensions  of  inter-item  variation  is  frequently  a 
difficult  endeavor.  Further,  once  identified,  it  may  be 
impossible  to  separate  these  dimensions  experimentally.  For 
example,  in  Paper  Folding  (French,  Ekstrom,  and  Price.  1963), 
number  of  folds  is  necessarily  confounded  with  number  of  holes  in 
the  final  unfolded  stimulus. 

An  alternative  approach  is  to  construct  a  new  task  in  which  a 
limited  number  of  design  facets  are  varied  systematically  and,  if 
possible,  orthogonally.  The  disadvantage  of  this  method  is  that 
the  investigator  must  then  establish  the  construct  validity  of 
the  task.  There  is  no  known  network  of  relationships  with  other 
tests  to  fall  back  on. 

The  S-P.eed-leye  1  Problem 
L.jw i  t q t.i.9.n_3.  Previous  rtethods  ■ 

The  most  glaring  deficit  of  previous  research  in  this  area 
has  been  the  unwitting  assumption  that  speed  of  performance  is 
psychologically  the  same  as  power  or  level  of  performance. 

Although  the  distinction  between  speed  and  power  is  not 
synonymous  with  the  difference  between  latency  and  correctness, 
those  who  model  cognitive  performance  using  reaction  time  data 
are  usually  studying  speed  of  performance.  In  any  experiment  of 
this  sort,  it  is  necessary  to  "keep  the  error  rate  down"  in  order 
to  minimize  missing  data.  This  is  usually  done  by  keeping  items 
simple,  thus  eliminating  individual  differences  in  power. 

Ignoring  power  seems  to  have  been  a  concession  to  methodological 
and  statistical  limitations,  not  a  deliberate  choice. 

Focusing  on  speed  of  solving  relatively  simple  items  would 
not  be  a  serious  limitation  if  speed  and  power  reflected  the  same 
psychological  functions.  However  examination  of  the  literature 
on  the  relationship  between  speed  and  power  suggests  that, 
particularly  in  the  area  of  spatial  thinking,  speed  of  solving 
simple  items  is  largely  independent  of  power  in  solving  more 
complex  items  of  the  same  type  (Lohman,  1979a). 
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Per  Imps  the  most  serious  limitation  in  modeling  speed  rattier 
than  level  of  cognitive  performance  arises  from  the  fact  that 
power  m  one  domain  correlates  strongly  with  power  in  other 
cognitive  domains,  and,  more  importantly,  with  external  criteria 
such  as  learning  in  academic  situations.  On  the  other  hand, 
speed  of  solving  a  simple  task  of  one  type  is  largely  independent 
of  speed  of  solving  other  simple  tasks.  Further,  these  various 
indices  of  speed  are  only  slightly  correlated  with  power  in  their 
respective  domains,  or  to  external  variables  such  as  academic 
achievement  (Lohman,  1979a). 

Fas  tojc.  A. DUEL Ly.  tig  Evidence. 

This  relationship  between  speed,  power,  and  complexity 
repeatedly  surfaced  in  a  recent  reanalysis  of  the  major  American 
factor  analytic  research  on  spatial  ability  (Lohman,  1979a).  The 
studies  were  reanalyzed  using  hierarchical  factor  methods  and 
nonmetric  multidimensional  scaling.  Simple,  highly  speeded  tests 
invariably  defined  specific  factors  which  fell  in  the  lower 
stratum  of  a  hierarchical  model  or  at  the  periphery  of  a  two  or 
three  dimensional  non metric  scaling  representation.  Further, 
these  sorts  of  factors  emerged  only  when  extremely  similar  tests 
were  included  in  the  analysis.  On  the  other  hand,  power  tests 
and  their  factors  invariably  fell  in  the  upper  levels  of  the 
hierarchical  model  or  near  the  center  of  the  scaling 
representations.  Tests  that  defined  these  power  factors  were 
complex,  yet  often  dissimilar  in  other  respects. 

The  most  consistent  factorial  split  was  between  complex, 
power  tests  (eg..  Paper  Folding  and  Surface  Development)  that 
defined  a  factor  called  " V i s u a  1 i :a t i o i "  and  simple,  highly 
speeded  tests  (e.g..  Cards  and  Flags)  that  defined  the  "Spatial 
Relations"  factor. 

Finally,  an  important  conclusion  of  this  review  was  that  the 
broad  power  factors  cannot  be  subdivided  into  the  various 
primaries  of  the  Thurstone  (1938)  or  French  (1951;  French. 

Ekstrom  and  Price.  1963)  variety.  Alternately,  second  order 
factors  that  are  extracted  from  a  matrix  of  primary  factor 
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correlations  do  not  coincide  with  corresponding  power  factors 
This  nonhierarchical  nature  of  the  factors  most  likely  reflects 
the  independence  of  speed  and  power  in  cognitive  performance. 

An  important  implication  of  this  is  that  studying  speed  of 
solving  simple  tasks  sheds  little  light  on  the  nature  of  higher 
order  aptitude  constructs  such  as  "intelligence.”  "verbal 
ability”  or  "spatial  ability."  This  is  especially  troublesome 
for  educational  applications,  since  it  is  these  more  general, 
power  aptitudes  which  will  most  likely  interact  with  educational 
treatments . 

The  data  of  Hunt  et  a  1 .  .  (1971)  and  Snow  et  al . .  (1976) 
support  this  argumant.  Both  of  these  investigations  found  that 
paramaters  derived  from  various  cognitive  tasks  were  only  weakly 
related  to  higher  order  aptitude  constructs.  While  some  ol  these 
correlations  were  statistically  significant,  none  were  very 
large. 


S.B  tt.fi  dr-Ii  filial  SJLudjjia  • 

A  review  of  the  literature  on  the  relationship  between  speed 
and  power  (Lohman.  1979a)  indicated  that  the  relationship  is 
probably  moderated  by  the  following  factors: 

1.  Difficulty.  Speed  and  power  probably  correlate 
differently  when  speed  is  measured  over  simple 
tasks  than  when  it  is  measured  over  complex  tasks. 

2.  Content  Area.  The  relationship  is  probably 
different  within  verbal  tests,  within  spatial 
tests,  etc. 

3.  Accuracy.  Correct  responses  are  generally  faster 
than  incorrect  responses  for  moderately  difficult 
to  difficult  itess.  On  simple  items,  however, 
incorrect  respories  are  usually  faster 

9.  Correct  "Yes"  vs.  Correct  "No."  Correct  "yes" 
responses  are  usually  taster  than  correct  "no" 
responses,  although  some  subjects  evidence  the 
opposite  pattern  on  some  spatial  tasks  (Cooper. 

1976),  further,  latency  for  these  two  types  ol 
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correct  responses  may  relate  differently  to 
difficulty,  and  thus  to  complexity. 

5.  Guessing.  The  error  variance  introduced  by  this 
factor  can  seriously  cloud  the  relationship  between 
speed  and  power.  Further,  there  are  differences  in 
willingness  to  guess,  both  between  subjects  and 
within  subjects  across  tasks  and  situations.  The 
effect  of  guessing  is  most  pronounced  in 
experiments  where  a  yes/no  response  is  required. 

6.  Alternative  Solution  Strategies.  Many  tasks  can  be 
solved  in  more  that  one  way.  There  is  some 
evidence  that  particular  tests  (such  as  the 

Gu i 1 f or d -Z immer man  Spatial  Orientation  test)  are 
especially  vulnerable.  Items  where  several 
multiple  choice  alternatives  are  provided  are  also 
particularly  suspect. 

7.  Motivation.  Thorndike  (1926).  Thurs  tone  (  1937)  and 
Furneaux  (1961)  all  agree  that  motivation  (or 
persistence)  can  influence  both  level  and  speed. 

Further,  the  relationships  between  motivation, 
speed  and  power  are  probably  not  linear:  one  can 
literally  Mtry  too  hard." 

Comp  1  ex 1 1  y 

It  uaj  noted  that  the  speed-power  difference  in  spatial 
factors  is  confounded  with  differences  in  test  complexity.  Speed 
is  measured  when  test  items  are  simple,  power  when  items  are 
complex.  But  what  is  complexity? 

It  is  important  to  distinguish  between  stimulus  complexity 
and  processing  complexity.  Stimulus  complexity  has  been  defined 
in  a  number  of  ways.  Garner  found  that  judged  goodness  of  matrix 
patterns  was  highly  correlated  with  equivalence  set  sire  (Garner. 
1979:  Garner  and  Clement,  1963).  Attneave  (1957).  Arnoult  (1960) 
and  Stenson  (1966)  found  that  complexity  ratings  of  random  forms 
were  strongly  correlated  with  the  number  of  turns  or  points  in 
the  form,  the  ratio  of  the  perimeter  squared  to  the  area  of  the 
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form  (compactness),  and  the  variability  of  the  angles  on  the 
perimeter  of  the  form. 

Stimulus  complexity  may  moderate  the  relationship  between 
speed  and  level  only  if  increases  in  stimulus  complexity  affect 
corresponding  increases  in  processing  complexity.  Cooper  (1975) 
and  Cooper  and  Podgorny  (1976)  claim  that  stimulus  complexity 
does  not  affect  the  rate  ol  mental  rotation.  However.  they 
employed  only  well  learned  stimuli  in  their  experiments.  The 
stimulus  complexity  of  unfamiliar  stimuli  may  alter  the  rate  of 
mental  rotation.  This  would  help  explain  why  capital  letters  can 
be  rotated  at  a  much  greater  rate  than  other  stimuli  (see  Cooper 
and  Shepard,  1976.  for  other  explanations  of  this  phenomenon). 
Tinally,  an  important  but  overlooked  possibility  is  that  the 
crucial  aspect  of  stimulus  complexity  in  many  difficult  spatial 
tests  is  not  the  complexity  of  the  stimulus  figure  that  is  given, 
but  rather  the  complexity  of  the  figure  that  must  be  mentally 
constructed  (c.f.»  Shepard  and  Feng.  1972). 

Processing  complexity,  on  the  other  hand,  has  received 
relatively  little  attention.  It  is  most  likely  a  function  of  the 
nature,  number,  and  diversity  of  processes  executed  during  the 
solution  of  a  task.  Certain  processes  may  be  more  complex  than 
others.  Executing  a  process  several  times  may  be  more  difficult 
than  executing  it  once.  Similarly,  switching  between  two 
different  processes  may  be  more  difficult  than  executing  the  same 
process  twice.  Thus,  the  more  complex  spatial  tests  usually 
require  the  subject  to  employ  several  different  processes,  such 
as  mental  folding  plus  mental  construction.  Alternately,  they 
may  require  the  repeated  application  of  one  process,  such  as 
mental  rotation  (e.g.,  as  in  the  Gu i 1 f o rd -Z imme rman  Spatial 
Visualization  test).  Finally,  they  may  require  the  execution  of 
a  relatively  simple  set  of  processes  on  a  complex,  unfamiliar 
stimulus  (e.g.,  as  in  Graham  and  Kendall's  (1948)  Memory  for 
Designs) . 
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U.np.a.g.fcuaa  Difficulty 

The  key  to  the  resolution  of  the  speed-power  problem  is  in 
the  construction  of  psychologically  meaningful  indices  of  item 
difficulty.  This  is  perhaps  best  accomplished  by  equating 
difficulty  with  complexity  of  processing  requirements  and  then 
constructing  information  processing  models  that  account  for  the 
various  dimensions  of  task  complexity. 

A  task  like  Block  Rotation  (Shepard  and  Metzler.  1971) 
provides  a  good  example  of  how  the  term  "item  difficulty”  can  be 
unpacked  in  a  psychologically  meaningful  way.  The  usual 
procedure  for  determining  item  difficulty  is  to  compute  the 
proportion  of  the  sample  passing  (or  failing)  each  item. 

However,  since  only  a  handful  of  subjects  pass  the  most  difficult 
items,  the  correlation  between  speed  and  power  would  be  based  on 
the  entire  sample  for  the  easiest  item  and  on  no  subjects  for  the 
most  difficult  item.  Another  limitation  is  that  these  group 
statistics  may  not  apply  at  the  individual  level.  Uhat  is 
difficult  for  one  subject  may  be  relatively  easy  for  another. 

At  the  individual  level,  item  difficulty  should  be  related  to 
both  correctness  and  latency.  Those  items  that  the  subject 
solves  correctly  are  obviously  easier  for  him  than  those  he 
fails.  Within  the  set  of  correctly  answered  items,  it  is  likely 
that  it  will  take  him  longer  to  solve  the  items  that  he  finds 
more  difficult.  However,  difficulty  is  then  defined  in  terms  of 
latency,  and  so  the  two  cannot  be  separated. 

Even  if  the  group  difficulty  scale  were  a  reasonable 
approximation  for  most  individuals,  one  would  still  be  faced  with 
the  problem  of  determining  why  some  items  were  more  difficult 
than  others.  If  inter-item  differences  in  processing  complexity 
were  unidimensional  (e.g.,  items  differ  only  in  the  degree  of 
angular  separation  between  the  two  stimulus  figures)  then  there 
is  a  good  chance  that  one  could  "discover"  this  dimension  by 
examining  the  group  difficulty  scale.  However,  if  the  inter-item 
complexity  is  two.  three  or  four  dimensional,  there  is  little 
chance  that  one  could  intuit  these  dimensions  from  an  inspection 
of  the  group  difficulty  scale.  This  becomes  obvious  when  one 
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realizes  that  there  may  be  interactions  between  the  different 
complexity  facets,  and  interactions  between  subjects  and  the 
various  complexity  facets.  Further,  it  is  highly  unlikely  that 
all  facets  of  complexity  present  in  the  test  will  be  represented 
in  an  orderly  (lactonal)  fashion.  The  ps ychomet r i ca  1 1  y  sound 
test  may  be  a  veritable  hodge-podge  of  facets. 

Therefore,  rather  than  attempting  to  analyze  a  complicated 
existing  test,  it  would  appear  more  reasonable  to  start  from  the 
other  end,  by  constructing  a  multi-faceted  test.  For  example, 
one  could  vary  the  dimensionality  of  the  stimulus  figures  (two 
versus  three);  the  number  of  rotations  required,  in  addition  to 
the  angular  value  of  each  rotation  (as  in  the  Gui 1 f ord-Zimmerman 
Spatial  Visualization  test);  and  the  like. 

Returning  to  the  block  rotation  task,  assume  that  item 
complexity  is  entirely  a  function  of  the  angular  separation 
between  the  two  stimulus  figures.  If  item  difficulty  is  captured 
by  this  un i d i mens i ona 1  scale,  then  the  following  conditions 
should  hold ; 

1.  For  the  group  data,  both  mean  latency  for  each  item 
type  and  proportion  of  the  total  group  failing  each 
item  type  should  be  mono t on i ca 1 1 y  increasing 
functions  of  complexity  (i.e.,  angular  separation). 

Z.  For  the  individual  data,  average  (within  subject) 
latency  of  correct  responses  for  each  item  type 
should  be  a  monotonical ly  increasing  function  of 
complexity.  Further,  individuals  should  not  skip 
points  on  the  scale.  Thus,  within  the  limits  of 
measurement  error,  if  an  individual  correctly 
solves  item  type  k  he  must  also  have  solved  all 
previous  item  types  1  to  k-1. 

Assuming  that  the  regression  of  log  time  on  angular 
separation  is  linear,  it  is  then  possible  to  compute  these 
regressions  for  the  group  and  individual  data.  Both  the  slopes 
and  intercepts  should  be  positive  for  the  group  data  as  well  as 
for  each  subject. 

If  either  parameter  is  significantly  negative  for  any 
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subject.  it  suggests  that  the  information  processing  model  is 
invalid  lor  that  subject.  Further,  the  intercept  must  be 
significantly  different  from  zero,  while  the  slope  may  be  zero. 

In  the  case  of  the  intercept,  this  is  because  processing  even  the 
simplest  item  must  take  some  time.  On  the  other  hand,  the  slope 
may  be  zero  if  it  takes  a  subject  as  long  to  process  a  complex 
item  as  it  does  to  process  a  relatively  simple  item.  However, 
this  also  implies  that  the  facet  in  question  is  not  a  meaningful 
aspect  of  complexity  for  that  subject. 

The  correlation  between  the  intercepts  and  the  lengths  of  the 
regression  lines  yields  the  correlation  between  speed  of 
performing  the  easiest  item  types  and  the  level  reached  on  more 
difficult  item  types.  The  individual  regression  lines  can  then 
be  projected  so  that  they  all  extend  to  the  point  of  maximum 
complexity.  At  this  point  the  correlation  between  these  (for  the 
most  part,  predicted)  latencies  and  the  known  lengths  of  the 
regression  lines  yields  the  best  estimate  of  the  relationship 
between  speed  of  performing  complex  tasks  and  level  of 
performance  on  those  tasks.  Projected  or  known  latencies  at 
intermediate  points  on  the  scale  can  also  be  correlated  with 
level  to  yield  intermediate  values  of  the  speed-level 
correlation. 

When  formulated  in  this  manner,  it  is  obvious  that  the 
correlation  between  speed  and  power  will  remain  constant  over  the 
range  of  item  complexity  only  if  there  are  no  individual 
differences  in  the  regression  slopes.  This  has  important 
implications  both  for  the  speed-power  problem  and  for  the 
g ene r a  1 i za b i 1 i t y  of  information  processing  parameters  derived 
from  simple  tasks.  If  individual  regression  slopes  are  parallel, 
then  the  relationship  between  speed  and  power  is  constant 
throughout  the  range  of  complexity  represented  in  the  analysis. 
Parallel  regression  slopes  also  imply  that  individual  differences 
in  speed  of  solving  simple  tasks  generalizes  to  the  speed  of 
solving  complex  tasks  of  the  same  type. 

On  the  other  hand,  if  there  are  individual  differences  in 
slopes,  then  speed  of  solving  simple  tasks  does  not  generalize  to 
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speed  of  solving  complex  tasks.  In  either  case,  the  relationship 
between  speed  and  power,  whether  constant  (i.e.,  slopes  constant) 
or  variable  tie.,  slopes  differ),  is  the  crucial  issue.  A 
reasonable  prediction  in  the  area  of  spatial  visualization  tasks 
would  be  that  there  are  individual  differences  in  slopes,  and 
that  the  relationship  between  speed  and  level  is  higher  for 
complex  tasks  than  for  simple  tasks. 


y ix-icti ii.  ai  IntUv vdual  JU tlargncea  m  SmLui.  Ability 
C.a.cxe  lJlic.ru.  I  Sturt  m 

Given  the  importance  of  the  speed-power  problem,  the  need  for 
a  method  for  studying  it,  and  the  relevance  of  factorial ly 
designed  experiments  where  complexity  is  manipulated  in  a 
systematic  manner,  the  next  step  is  to  select  the  design  facets. 
This  presumes  some  knowledge  of  the  important  dimensions  of  the 
aptitude  construct  in  question. 

At  the  most  basic  level,  spatial  thinking  requires  (among 
other  things)  the  ability  to  encode,  remember,  transform,  and 
match  spatial  stimuli.  Factors  such  as  Closure  Speed  (i.e.. 
speed  of  matching  incomplete  visual  stimuli  with  their  long  term 
memory  representations),  Perceptual  Speed  (speed  of  matching 
visual  stimuli),  Visual  Memory  (short  term  memory  for  visual 
stimuli)  and  Kinesthetic  (speed  of  making  left-right,  up-down 
discriminations)  may  represent  individual  differences  in  the 
speed  or  efficiency  of  these  basic  cognitive  processes.  However, 
these  factors  surface  only  when  extremely  similar  tests  are 
included  in  a  test  battery.  Such  tests  and  their  factors 
consistently  fall  near  the  periphery  of  a  scaling  representation, 
or  at  the  bottom  of  a  hierarchical  model. 

While  the  processes  that  these  factors  hypothet ica 1 1 y 
represent  are  certainly  spatial  in  nature,  they  are  not  usually 
the  referent  of  the  term  "spatial  ability."  While  a  number  of 
spatial  factors  have  been  identified,  only  three  survive  a  more 
rigorous  examination  using  hierarchical  factor  methods  (see 
tollman.  1979a).  All  of  the  factors  involve  mental 
transformation.  They  are; 
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Spatial  Relations.  This  factor  is  defined  by  tests 
such  as  Cards.  Flags  and  Figures  (Tliurstone,  1938). 
It  probably  represents  individual  differences  in 
the  speed  of  mentally  rotating  or  reflecting  a 
simple  visual  stimulus.  The  tests  that  define  this 
factor  are  all  parallel  forms  of  one  another,  and 
the  factor  emerges  only  if  these  or  highly  similar 
tests  are  included  in  the  battery. 

Spatial  Orientation.  This  factor  appears  to 
involve  the  ability  to  imagine  how  a  stimulus  array 
uill  appear  from  another  perspective.  In  the  true 
spatial  orientation  test,  the  subject  must  imagine 
that  he  is  reoriented  in  space,  and  then  make  some 
judgment  about  the  situation.  There  is  often  a 
left-right  discrimination  component  in  these  tasks, 
but  this  discrimination  must  be  made  from  the 
imagined  perspective.  However,  the  factor  is 
difficult  to  measure  since  tests  designed  to  tap  it 
are  often  solved  by  mentally  rotating  the  array 
rather  than  reorienting  an  imagined  self. 
Visualization.  The  factor  is  represented  by  a  wide 
variety  of  tests  such  as  Paper  Folding.  Form  Board. 
WflIS  Block  Design.  Hidden  Figures.  Copying,  etc. 

The  tests  that  load  on  this  factor,  in  addition  to 
their  spatial-figural  content,  share  two  important 
features:  a)  they  are  all  administered  under 

relatively  unspeeded  conditions,  and  b)  most  are 
much  more  complex  than  corresponding  tests  that 
load  on  the  more  peripheral  factors.  Tests 
designed  to  measure  this  factor  usually  fall  near 
the  center  of  a  two  dimensional  scaling 
representation,  and  are  often  quite  close  to  tests 
of  Spearman's  "g"  (such  as  Raven  Matrices  or  Figure 
Classification)  or  Cattell's  (1963)  Gf. 

examining  the  type  of  tests  that  load  on  these  three 


spatial  factors,  two  major  types  of  mental  transformation  appear 
to  be  involved.  The  first  is  mental  movement.  Reflecting, 
rotating,  folding,  or  simply  imagining  that  a  stimulus  is  moved 
from  one  position  in  an  array  to  another  position,  are  all 
varieties  of  mental  movement. 

The  second  type  of  mental  transformation  may  be  called 
construction.  There  are  two  types  of  constructions: 
reproduction  (i.e.<  physical  construction)  and  combination  (i.e.. 
mental  construction).  At  the  simplest  level,  reproduction  is 
represented  in  tests  like  Thurstone’s  (1938)  Copying,  where  the 
subject  must  correctly  copy  a  stimulus  design.  At  the  next 
level,  it  is  represented  by  tests  like  Graham  and  Kendall's 
(1948)  Memory  for  Designs,  where  the  design  must  be  reproduced 
from  memory.  One  important  difference  between  these  tests  and 
those  that  load  on  more  peripheral  factors  like  Perceptual  Speed 
or  Memory  Span  is  that  the  design  must  be  reproduced,  not  just 
recognised,  and  the  reproduced  design  must  be  a  veridical 
representation  of  the  stimulus.  Retaining  a  veridical  mental 
image  of  a  design  may  be  an  important  component  of  other  complex 
spatial  tasks,  such  as  Hidden  Figures  (French  et  al..1963). 

In  the  mental  construction  tasks,  on  the  other  hand,  the 
subject  must  actually  construct  a  mental  image,  usually  by 
reorganising  the  stimulus  in  a  new  way.  The  clearest  examples  of 
this  sort  of  process  are  tests  such  as  Form  Equations  (El  Koussy. 
1935)  and  Paper  Form  Board  (eg..  Thurstone  1938;  French.  Ekstrom 
and  Price.  1963).  Mental  construction  is  an  important  component 
of  many  complex  spatial  tests.  For  example,  in  Surface 
Development  (French  et  al..1963)»  the  task  is  not  simply  to  fold 
the  sides  of  the  figure  mentally  (i.e..  mental  movement ) .  but 
rather  to  construct  a  three  dimensional  figure  from  a  two 
dimensional  drawing.  Similarly,  in  Paper  Folding  (French  et 
al..1963).  the  examinee  must  construct  new  holes  as  he  mentally 
unfolds  the  stimulus.  Finally,  mental  construction  may  take  the 
form  of  mentally  deleting  parts  of  a  stimulus,  as  in  Match 
Problems  (Guilford  and  Hoepfner.  1971).  This  may  also  be  an 
important  component  of  tests  such  as  Embedded  Figures  (Mitkin. 


0  1  t  m  a  n  ,  Raskin  and  Karp.  1971)  or  Hidden  Figures  (French  et  a  1  .  . 

1  96  3  )  . 
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Until  the  pioneering  uork  o!  Shepard  and  his  colleagues, 
evidence  on  the  nature  of  spatial  transformation  processes  was 
limited  to  the  speculations  of  factor  analysts  on  the  nature  of 
their  factors  Several  important  insights  into  the  nature  of 
spatial  thinking  have  emerged  from  this  work.  The  first 
significant  finding  uas  that  the  time  required  to  determine 
whether  two  stimuli  were  identical  uas  a  linear  function  of  the 
angular  separation  between  them  (Shepard  and  Metzler.  1971). 

This  result  uas  replicated  in  later  studies  using  alphanumeric 
characters  (Cooper  and  Shepard.  1973a.  1973b).  random  two 

dimensional  polygons  (Cooper  and  Podgorny.  1976;  Cooper.  1975). 
and  two  dimensional  stimuli  that  had  to  be  mentally  folded  into 
three  dimensional  cubes  (Shepard  and  Feng,  1972).  Other 
important  findings  uere;  a)  the  function  relating  time  to 
perform  "in  depth"  rotations  to  the  angular  separation  between 
figures  was  essentially  the  same  as  that  for  picture  plane 
rotations  (Shepard  and  fletzler.  1971);  b)  the  time  required  to 
prepare  for  an  object  in  a  specified  orientation  uas  a  linear 
function  of  the  angular  separation  between  the  specified 
orientation  and  the  normal  or  trained  orientation  (Cooper  and 
Shepard.  1973a.  1973b):  c)  the  rate  of  rotation  for  well  learned 

complex  figures  uas  essentially  the  same  as  that  for  simple 
figures  (Cooper  and  Podgorny.  1976);  and  d)  when  experimental 
conditions  were  equated,  there  were  no  differences  between  the 
rate  of  rotation  for  two  and  three  dimensional  stimuli  (Podgorny 
1975).  These  findings  place  important  limitations  on  the  type  of 
models  that  may  be  advanced  to  account  for  the  phenomenon.  In 
particular,  Shepard  and  his  colleagues  have  argued  that  the 
internal  processes  and  representations  underlying  the  rotation 
task  are  "analog"  in  nature  (Cooper  and  Shepard.  1973b;  fletzler 
and  Shepard.  1974;  Shepard.  1975). 

Experimental  evidence  on  the  nature  of  mental  construction 
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processes  is  less  extensive.  Palmer  (1974)  found  that  the  time 
to  synthesise  a  stimulus  mentally  was  inversely  related  to  the 
"goodness”  of  the  component  parts.  However,  there  were  important 
individual  differences  in  discriminative  reaction  time  to  the 
test  stimulus,  indicating  that  some  of  the  subjects  may  not  have 
completely  synthesised  the  component  parts.  Glushko  and  Cooper 
(1975)  used  a  different  paradigm  to  investigate  mental 
construction.  They  asked  subjects  to  construct  a  mental  image  of 
a  composite  figure  on  the  basis  of  a  verbal  description  of  the 
component  parts  Construction  time  increased  with  the  number  of 
component  parts,  but  discriminative  reaction  time  to  a  test 
figure  was  independent  of  figural  complexity. 

■Lni  im-iUAl  EUL1  e.r  gases  ui  the  Exper  lmen  t  a  1  S  t  ud  i  es  . 

Although  most  experimental  paradigms  relegate  individual 
differences  to  the  error  term,  individual  differences  in 
performance  on  spatial  tasks  have  been  recognised  to  be  striking 
enough  that  investigators  are  now  forced  to  deal  with  them.  Even 
within  small,  select  samples,  the  estimated  rates  of  mental 
rotation  for  individual  subjects  have  differed  by  a  factor  of  two 
or  three  (Cooper  and  Shepard,  1976).  There  are  important 
individual  differences  in  discriminative  reaction  time  to  the 
test  stimulus  as  well.  Cooper  (1976)  (see  also  Cooper  and 
Podgorny.  1976)  found  that,  for  some  subjects,  "same"  responses 
were,  on  the  average,  faster  than  "different"  responses. 

Further,  the  speed  of  "different"  responses  was  not  related  to 
the  similarity  of  the  test  stimulus  to  the  standard.  For  other 
subjects,  "different"  responses  were  generally  faster  than  "same" 
responses,  and  "different"  reaction  time  decreased  as  the 
standard  and  test  stimuli  became  increasingly  dissimilar.  Cooper 
(1976)  speculates  that  these  patterns  may  reflect  important 
differences  in  the  nature  of  the  comparison  processes. 

Finally,  there  are  important  individual  differences  in  the 
ability  to  solve  the  more  difficult  spatial  tasks  in  the  first 
place.  For  example,  of  the  ten  Stanford  undergraduates  who 
participated  in  the  Shepard  and  Feng  (1972)  experiment,  three  had 
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error  rates  so  high  (16-29%)  that  their  data  were  discarded. 
Further,  the  correlation  (for  the  seven  remaining  subjects) 
between  the  number  of  errors  and  mean  reaction  time  on  the  40 
error  free  items  was  -.28.  uhich  is  consistent  with  tne  previous 
argument  here  regarding  the  relative  independence  of  speed  and 
level  . 

Solution  Strategies 

Individual  differences  in  solution  strategy  are  a  major 
problem  for  both  experimental  and  correlational  studies  of 
spatial  thinking.  While  some  subjects  solve  verbal  problems  such 
as  anagrams,  syllogisms,  and  three  term  series  problems  using  a 
predominately  spatial  strategy,  it  is  possible  to  construct 
verbal  tasks  where  spatial  strategies  would  be  of  little  or  no 
assistance  (e.g.,  a  simple  vocabulary  test).  On  the  other  hand, 
it  is  extremely  difficult  to  devise  spatial  tasks  that  cannot  be 
solved  by  some  nonspatial  strategy. 

Further,  subjects  often  change  strategies  as  they  become  more 
familiar  with  the  task,  or  as  item  difficulty  increases  (Meyers, 
1958;  Barrat.  1953;  Lohman,  1977a).  Careful  selection  of  the 
experimental  task  and  open  response  format  can  eliminate  some 
strategy  shifts.  Although  retrospective  and  introspective 
reports  are  often  of  dubious  validity,  they  can  be  obtained,  and 
may  suggest  at  least  major  individual  differences  in  strategy  or 
radical  shifts  in  strategy  over  items. 

However,  relationships  between  individual  differences  in 
patterns  of  scores  over  facets  and  reference  constructs  can 
provide  more  direct  evidence  on  the  nature  of  overall  strategies 
and  shifts  in  strategy  over  items.  For  example,  the  correlation 
between  the  pattern  of  latency  cell  means  over  a  rotation  facet 
and  reference  variables  may  suggest  that  steep  slopes  are  related 
to  verbal  ability  and,  by  implication,  verbal  strategies.  Other 
mean  contrasts  may  be  formulated  to  address  both  general  and 
specific  hypotheses  about  strategies,  such  as: 

1.  Do  changes  in  difficulty  effect  overall  changes  in 

a 

s peed -accu racy  tradeoff? 
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2.  Uhat  is  the  relationship  between  individual 
differences  in  s peed -accu r ac y  tradeoff,  overall  or 
over  a  particular  facet,  and  reference  variables? 

3.  Uhat  are  the  relationships  between  individual 
differences  in  the  patterns  of  cell  means  for 
latency  or  correctness  over  a  particular  facet  and 
reference  variables? 

Examination  of  the  relationships  between  individual 
differences  in  the  patterns  of  scores  over  design  facets  and 
reference  tests  can  provide  a  powerful  method  for  exploring 
intra-  and  inter-subject  variation  in  solution  strategy  (Calfee, 
1976  )  . 


Methodological  Issues 
Limitations  q±  Tests  and  factors  ■ 

Since  correlations  between  task  parameters  and  reference 
tests  and  factors  constitute  a  major  portion  of  the  results  of 
this  experiment,  some  comments  on  the  advantages  and  limitations 
of  such  data  are  in  order.  Uhile  it  is  important  to  demonstrate 
that  performance  on  the  experimental  task  is  related  to 
performance  on  reference  tests  or  factors,  such  correlation  (or 
lack  of  it)  cannot  be  considered  the  final  arbiter  of  construct 
validity.  There  are  good  tests  and  bad  tests.  Many  tests,  and 
especially  instruments  intended  ohly  for  research  (e.g.,  French. 
Ekstrom  and  Price.  1963)  are  not  carefully  constructed,  even  by 
psychometric  standards.  Further,  most,  if  not  all,  test  items 
can  be  solved  in  more  than  one  way.  This  is  particularly  true  of 
spatial  tests.  For  example,  only  seven  subjects  in  this 
experiment  reported  that  they  always  rotated  the  stimuli  on  the 
Cards  and  Figures  tests.  Five  reported  that  they  never  rotated, 
uhile  most  (13)  reported  rotating  some  stimuli  and  using  logical 
cues  to  solve  other  items.  Data  were  unavailable  for  six 
subjects.  Thus,  it  is  unreasonable  to  expect  large  correlations 
between  performance  on  these  tests  and  performance  on  rotation 
items  in  the  experimental  task. 

Similarly,  factors  representing  common  covariation  in 


clusters  of  tests  can  be  shadow  or  substance.  Some  of  the  more 
specific  factors  (e.g..  Spatial  Relations,  Closure  Speed. 
Perceptual  Speed)  are  the  product  of  including  what  amounts  to  a 
parallel  forms  reliability  coefficient  in  the  matrix  of  test 
intercorrelations.  Such  factors  regularly  disappear  when  the 
tests  are  altered  to  remove  method  covariance.  Further,  factor 
analysis  assumes  that  subjects  are  not  solving  tests  in  different 
ways.  Correlations  and  factor  patterns  can  change  drastically 
when  data  are  analyzed  within  distinguishable  strategy  groups 
(French.  1965;  Lohman.  1979a). 

Thus,  correlations  between  total  scores,  cell  totals,  or 
individual  contrast  scores  and  reference  tests  can  provide  useful 
but  limited  evidence  on  the  construct  validity  of  the  score  or 
contrast.  The  importance  of  reference  tests  lies  in  the  link,  no 
natter  how  weak  or  uncertain  it  may  be.  with  the  network  of 
individual  difference  constructs  established  through  fifty  years 
of  correlational  research  on  human  abilities. 

Two  more  specific  limitations  of  the  reference  test 
correlations  should  also  be  noted.  First,  the  30  subjects  who 
participated  in  this  experiment  were  deliberately  selected  to 
represent  a  wide  range  of  verbal  and  spatial  abilities. 
Correlations  between  task  parameters  and  reference  tests  could  be 
quite  different  in  a  more  restricted  sample. 

Second,  subjects  apparently  interpreted  the  instruction  to 
"solve  each  item  as  quickly  as  you  can"  in  quite  different  ways. 
While  the  type  of  analysis  performed  provided  considerable 
insight  into  the  effects  of  speed -  accuracy  tradeoffs,  the  results 
of  the  experiment  could  change  substantially  if  subjects  were 
somehow  induced  to  perform  as  rapidly  as  they  could  without 
sacrificing  accuracy. 

Finally,  it  must  be  emphasized  that  this  experiment  was 
designed  to  investigate  the  speed-level  hypothesis.  Thus,  even 
though  considerable  evidence  on  the  nature  of  individual 
differences  in  spatial  thinking  emerged  from  the  analysis,  the 
study  was  not  designed  to  investigate  such  hypotheses  rigorously. 


IS 


Contrast 


Uhen  subject-factor  interactions  are  large,  plots  of  mean 
scores  may  be  misleading.  This  is  especially  true  uhen  the 
s u b j ec t - f ac t o r  interaction  is  large  and  significant,  and  the 
factor  itself  is  nonsignificant. 

Subject  variability  in  the  dependent  measure  over  a 
particular  design  facet  may  be  examined  by  computing  contrast 
scores  for  each  subject  and  then  correlating  these  scores  with 
overall  performance  a.nd  reference  tests  (Calfee,  1976).  Examples 
of  procedure  for  computing  individual  contrast  scores  are 
presented  elsewhere  (see  Lohman,  1979b). 

C-ans Unions  ■ 

Previous  research  on  the  nature  of  aptitude  processes  has 
ignored  the  important  differences  between  speed  and  level  of 
performance.  In  particular,  the  speed  of  correctly  solving 
simple  items  is  largely  independent  of  level  attained  on  more 
conplex  items  of  the  same  type.  Failure  to  recognize  this  fact 
has  produced  considerable  confusion  in  the  factor  analytic 
literature,  and  in  factor  models  of  human  abilities.  Likewise, 
it  has  resulted  in  constructs,  models,  and  parameters  in 
experimental  psychology  that  are  of  questionable 
g ener a  1 i zab i 1 i t y ,  particularly  for  process  understandings  of 
constructs  such  as  "verbal  ability,"  "spatial  ability,"  and 
"Intel lig  enc  e . " 

The  key  to  the  resolution  of  the  speed-power  problem  is  the 
construction  of  psychologically  meaningful  indices  of  item 
difficulty.  This  is  perhaps  best  accomplished  by  equating 
difficulty  with  complexity  of  processing  requirements,  and  then 
constructing  information  processing  models  that  account  for  the 
various  dimensions  of  task  complexity. 

Within  the  area  of  spatial  abilities,  the  two  major 
complexity  facets  are  construction  and  mental  movement. 
Construction  may  be  divided  into  physical  construction  (i.e., 
reproduction)  and  mental  construction.  nental  movement  includes 
rotation,  reflection,  folding,  and  transposition.  Complex 
spatial  tasks  usually  require  more  than  one  of  these  processes. 


such  as  folding  and  mental  construction.  Other  complex  spatial 
tasks  require  either  the  repeated  application  of  a  single  process 
(e.g.f  mental  rotation),  or  the  execution  of  one  process  on  a 
complex,  unfamiliar  stimulus. 

QvacvisH  si  study. 

The  present  experiment  uas  designed  to  explore  individual 
differences  in  spatial  ability,  with  particular  emphasis  on  the 
interrelationships  of  speed,  level  and  complexity,  and  their 
relationships  with  aptitude  reference  constructs.  Complexity  was 
represented  in  two  ways:  stimulus  complexity  and  processing 
complexity.  Stimulus  complexity  was  determined  by  the  number  of 
points  on  both  the  given  and  to-be-constructed  stimuli. 

Processing  complexity  was  manipulated  by  increasing  the  "amount” 
of  a  particular  operation  (i.e.,  degrees  of  rotation  or  number  of 
stimulus  pieces  that  are  added  together)  as  well  as  the  variety 
of  operations  (construction  and  rotation  alone  versus 
construction  plus  rotation). 

Mental  movement  was  represented  by  mental  rotation,  in  the 
tradition  of  Shepard  and  his  colleagues  (Shepard  and  Metzler. 

1971:  Cooper  and  Podgorny.  1976).  Construction  was  represented 
in  two  ways:  a)  by  requiring  the  subject  to  combine  two  or  three 
stimulus  figures  mentally,  and  b)  by  asking  him  to  draw  the 
combined,  rotated,  or  combined  and  rotated  stimulus  figure. 

The  major  hypotheses  of  the  investigation  were: 

1.  Speed  of  performing  simple  spatial  tasks  is  largely 
independent  of  the  level  of  such  tasks  that  a 
subject  can  ultimately  solve. 

2.  Level  scores  correlate  more  strongly  than 
corresponding  speed  scores  with  higher-order 
aptitude  constructs  such  as  G,  Gf,  and  Vz. 

The  present  study  also  provided  an  opportunity  to  study  the 
relationships  between  individual  differences  in  the  patterns  of 
latency  and  correctness  cell  means  over  design  facets  and 
reference  aptitude  constructs. 


CHAPTER  JLL 
B.ESISN  AUB  PROCEDURE 


Design 

There  were  four-major  components  in  the  design  of  this 
experiment:  the  experimental  task,  the  drau  trials,  the  simple 

reaction  time  control,  and  the  reference  constructs. 

Experimental  Task 

The  design  of  the  experimental  task  is  shown  in  Figure  2.1. 
All  design  facets  were  fully  crossed  except  level  one  of  the 
construction  facet,  which  was  not  crossed  with  type  of  addition 
or  product  complexity. 

Levels  of  the  construction  facet  refer  to  the  number  of 
stimulus  figures  presented  during  a  trial.  When  more  than  one 
stimulus  was  presented,  later  stimuli  were  to  be  added  to  the 
initial  stimulus  in  a  prescribed  manner  to  construct  a  new 
stimulus  . 

Levels  of  the  rotation  facet  indicate  the  number  of  degrees 
through  which  the  subject  was  required  to  rotate  the  given  or 
constructed  stimulus.  When  both  construction  and  rotation  were 
required  on  a  given  trial,  construction  always  preceded  rotation. 

Stimulus  complexity  refers  to  the  average  number  of  points 
on  the  stimulus  figures  presented  during  a  trial.  The  levels 
were  low  (3-4  points),  medium  (5-6  points),  and  high  (7-8 
points  )  . 

Match  refers  to  whether  a  correct  or  incorrect  test  stimulus 
was  presented  at  the  conclusion  of  the  trial. 

Type  of  addition  refers  to  whether  the  to-be-added  stimuli 
were  added  to  the  right  (level  1)  or  the  left  (level  2)  of  the 
initial  stimulus. 

Product  complexity  refers  to  the  number  of  points  on  the 
constructed  stimulus.  Again,  low  meant  3  or  4  points  and  high  7 
or  8  points. 

Since  the  entire  design  was  replicated  within  each  subject, 
subjects  may  be  considered  the  seventh  fully  crossed  factor. 
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ficaw  Trials 


The  draw  trials  were  chosen  by  taking  a  one  quarter  fraction 
of  a  2^  factorial  design  in  uhich  the  match  facet  uas  eliminated 
and  all  three  level  factors  were  converted  to  tuo  level  factors. 
This  uas  accomplished  by  ignoring  level  one  of  the  construction 
facet,  defining  a  rotate  and  no  rotate  factor  and  ignoring  level 
2  (medium  complexity)  on  the  stimulus  complexity  factor. 

The  quarter  fraction  of  this  design  defined  16  cells.  One 
item  uas  constructed  for  each  no  rotation  cell  and  tuo  for  each 
rotation  cell,  one  for  both  90  and  180  degree  rotations. 

Further,  magnitude  of  rotation  uas  counterbalanced  with  the  four 
values  of  product  complexity  (3, 4, 7, 8)  uithin  each  level  of  the 
construction  facet. 

In  addition  to  these  24  items  (eight  from  the  no  rotation 
cells  and  16  from  the  rotation  cells),  six  items  uere  constructed 
to  represent  the  crossing  of  rotation  and  stimulus  complexity  at 
level  one  of  the  construction  facet.  The  complete  set  of  drau 
trials  thus  contained  30  items. 

SimaLfi  Rgact.i.pn  Lung  Co  n  t  r  o  1 

This  condition  consisted  of  three  consecutive  blocks  of  20 
true-false  color  discrimination  trials.  The  ten  true  and  ten 
false  trials  uere  separately  randomized  uithin  each  block. 

Constructs 

Since  the  major  goal  of  the  study  uas  to  examine  individual 
differences  in  spatial  thinking,  the  selection  reference 
constructs  constituted  an  important  aspect  of  the  overall  design. 
Tests  representing  general  intelligence.  Gf.  Gc .  Visualization, 
Perceptual  Speed,  Closure  Speed  and  Memory  Span  uere  already 
available  for  these  subjects.  Additional  tests  uere  chosen  to 
tap  particular  aspects  of  individual  differences  in  spatial 
thinking  not  clearly  represented  in  any  of  the  previous  reference 
tests.  Cards  (French,  Ekstrom  and  Price,  1963)  and  Figures 
(Thurstone.  1938)  uere  chosen  to  represent  the  Spatial  Relations 
factor.  Copying  (French.  Ekstrom  and  Price,  1963)  and  Memory  for 
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Designs  (Graham  and  Kendall.  1948)  were  chosen  to  represent 
rudimentary  aspects  of  spatial  construction  (see  Chapter  I).  A 
complete  list  of  the  reference  battery,  with  a  brief  description 
and  example  item  from  each  test  is  reported  in  Lohman  (1979b). 

Aas.i.qpwsgjt  a±  Hauls,  la  Blocks 

The  experimental  task  was  organized  into  five  blocks.  The 
first  block  contained  the  60  reaction  time  control  trials.  The 
next  four  blocks  were  composed  of  experimental  and  draw  trials. 
The  216  experimental  trials  were  first  randomly  assigned  to  one 
of  the  four  blocks  under  the  constraint  that  all  main  effects  be 
kept  independent  of  block.  Since  there  was  only  one  observation 
per  cell  at  levels  2  and  3  of  the  construction  facet,  some 
confounding  of  interaction  vectors  and  block  was  inevitable.  The 
degree  of  confounding  was  determined  by  correlating  vectors 
representing  the  linear  and  quadratic  components  of  blocks  (in 
the  order  administered  to  all  subjects)  with  the  design  vectors. 
For  both  linear  and  quadratic  block  vectors  only  five 
correlations  with  design  vectors  were  greater  than  .20.  For  the 
linear  component,  only  one  correlation  was  above  .25  (KRST2. 
r*.27).  For  the  quadratic  component,  two  correlations  were  above 
.25  (RSMP1.  r  = . 27 ;  and  KRSHTP2,  r*.27).  Thus,  even  for  the 
design  vectors  most  strongly  correlated  with  block,  only  7’i  of 
the  variance  accounted  for  by  those  vectors  may  be  attributed  to 
linear  or  quadratic  effects  over  blocks. 

The  30  draw  trials  were  also  randomly  assigned  to  one  of  the 
four  blocks,  for  totals  of  62,  61,  61.  and  62  items  in  blocks  one 
through  four  respectively.  Items  were  then  randomly  ordered 
within  each  block  before  filming. 

Subjects 

Subjects  were  selected  from  an  initial  reference  population 
of  241  Palo  Alto  high  school  students  and  123  Stanford  University 
undergraduates.  A  large  reference  battery  was  administered  to 
these  subjects  in  the  Spring  of  1976.  Most  had  participated  in 
other  experiments  conducted  by  the  Aptitude  Project.  Of  the 
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original  171  males,  158  uere  available  for  the  present 
experiment.  Those  who  had  participated  in  similar  experiments 
uere  eliminated.  Thirty-one  subjects  uere  then  selected  from  the 
remaining  sample  on  the  basis  of  composite  spatial  and  verbal 
scores.  Tests  uere  selected  for  each  of  the  three  composites  on 
the  basis  of  a  uithin  sex  factor  analysis  of  the  entire  reference 
battery.  The  first  spatial  composite  uas  formed  by  normalizing 
and  then  summing  scores  for  Paper  Folding,  Surface  Development. 
Hidden  Figures  Call  from  French,  Ekstrom  and  Price,  1963),  Blocl 
Design,  Picture  Arrangement  (both  from  Uechsler.  1955)  and  the 
advanced  Raven  Progressive  Matrices  (Raven,  1962).  This 
composite  is  akin  to  Cattell's  (1963)  Gf  or  Zimmerman’s  (1953) 

Vz.  The  second  spatial  composite  uas  formed  by  normalizing  and 
then  summing  each  subject’s  scores  on  Paper  Form  Board  (French, 
Ekstrom  and  Price,  1963)  and  Object  Assembly  (Uechsler,  1955). 
Although  the  factor  analytic  evidence  for  the  separation  of  these 
tuo  tests  from  those  in  the  first  composite  uas  weak,  both  tests 
uere  hypothesized  to  involve  a  form  of  mental  construction  that 
could  be  important  in  the  present  experiment.  A  scatterplot  of 
scores  on  the  tuo  spatial  composites  for  the  available  population 
of  males  is  shown  in  Figure  2.2.  Subjects  selected  for  the 
present  experiment  are  represented  by  squares  in  this  figure. 

The  correlation  between  the  tuo  composites  uas  .68. 

The  verbal  composite  uas  formed  by  summing  normalized  scores 
for  an  abbreviated  version  of  the  Terman  Concept  Mastery  Test 
(Terman,  1950;  however,  see  Lohman.  1979b),  Information, 
Comprehension,  Similarities  and  Vocabulary  (all  from  Uechsler. 

1  955  )  . 

A  scatterplot  of  scores  on  the  first  spatial  composite  and 
the  verbal  composite  for  the  158  males  is  shoun  in  Figure  2.3. 
Again,  subjects  participating  in  the  present  experiment  are 
represented  by  squares. 

As  can  be  seen  in  Figures  2.2  and  2.3,  the  subjects  who 
participated  in  this  experiment  uere  quite  representative  of  the 
reference  population  on  both  spatial  ability  and  verbal  ability. 

All  subjects  uere  paid  for  their  participation.  One  subject 


completed  only  the  first  session  of  the  experiment  and  had  to  be 
replaced.  His  scores  are  represented  by  a  triangle  in  Figures 
2 . 2  and  2.3. 


Trials  differed  in  the  number,  variety  and  magnitude  of 
transformations  required,  the  type  of  test  probe,  and  the 
complexity  of  the  given  or  constructed  stimulus.  Ignoring  for 
the  moment  type  of  test  probe,  stimulus  complexity  and  product 
complexity,  the  major  variants  of  trial  types  are  shown  in  Figure 
2.4.  For  purposes  of  illustration,  all  test  stimuli  are  correct. 

As  is  shown  in  Figure  2.4,  from  four  to  six  frames  were 
presented  on  each  trial.  The  first  and  last  frames  were  always 
"ready"  and  "confidence"  respectively.  The  first  stimulus  was 
presented  on  the  second  frame,  and  the  test  stimulus  or  "draw" 
frame  immediately  preceded  the  "confidence"  frame. 

Column  one  in  Figure  2.4  shows  the  sequence  of  events  when 
no  rotation  or  construction  were  required:  first  the  "ready" 
frame,  next  the  presentation  of  the  base  stimulus  then  the  test 
probe,  and  finally  the  confidence  frame. 

Columns  two  and  three  in  Figure  2.4  show  the  sequence  of 
frames  for  90  and  180  degree  rotations.  Type  one  addition  is 
shown  in  columns  four  and  five,  and  type  two  addition  in  columns 
six  and  seven.  Note  that  when  two  stimuli  are  to  be  added  on  the 
second  frame,  the  stimulus  on  the  left  must  be  added  first.  Also 
note  that  the  " ♦ "  sign  indicates  the  location  of  the  previous 
stimulus.  Two  examples  of  the  four  possible  combinations  of  both 
construction  and  rotation  are  shown  in  columns  eight  and  nine. 
Finally,  column  ten  presents  an  example  of  a  draw  trial.  Draw 
trials  were  the  same  as  any  other  trial  except  that  an 
instruction  to  draw  the  resulting  figure  replaced  the  test  probe. 

g.an&..t.t.ujc.t„igp  al  stimuli 

Seven  sets  of  75  stimuli  were  obtained  by  randomly  selecting 
and  then  plotting  three  to  eight  pairs  of  points  in  each  of  seven 


Figure  2.4  Examples  of  the  major  trial  types  in  the  experimental  task. 


different  3x3  to  8x8  grids.  The  points  were  then  connected  using 
the  Method  1  of  Attneave  and  Arnoult  (1956).  An  informal  pilot 
study  with  these  stimuli  revealed  that  most  uere  far  too 
difficult  to  remember  or  manipulate,  especially  when  attempting 
to  combine  them  mentally.  In  particular,  irregular  forms  uere 
much  more  difficult  than  regular  or  symmetric  forms. 

Accordingly,  irregular  stimuli  uere  "normalized”  by  making  almost 
parallel  lines,  parallel,  nearly  symmetric  lines,  symmetric, 
angles  and  lines  that  uere  nearly  equal,  equal,  and  angles  that 
uere  nearly  90  degrees,  right  angles.  Four  to  eight  point 
stimuli  uithout  at  least  one  right  angle,  or  tuo  parallel  or 
symmetric  lines  uere  then  eliminated.  Each  figure  uas  then 
oriented  so  that  either  one  line  or  the  major  axis  uas  vertical 
or  horizontal.  Examples  of  stimuli  from  each  of  the  three  levels 
of  complexity  are  shcun  in  Figure  2.5.  Since  all  three  point 
stimuli  are  triangles,  stimulus  complexity  uas  necessarily 
confounded  uith  stimulus  familiarity  (or  uniqueness)  in  the 
design.  Every  effort  uas  made  to  avoid  using  the  same  stimulus 
more  than  once.  Thus,  even  though  all  three  point  stimuli  uere 
triangles,  there  are  an  infinite  number  of  different  triangles. 
Some  repetition  of  forms  uas  unavoidable,  particularly  on  those 
items  where  low  complexity  stimuli  uere  combined  to  form  a  j 
complexity  product.  Incorrect  test  stimuli  uere  formed  by 
reflecting  or  incorrectly  rotating  either  the  entire  stimulus  or 
one  of  its  component  stimuli. 

Const  ruct  i  on  oj.  Construction  Items 

Construction  items  uere  developed  by  choosing  a  target 
stimulus  of  the  required  complexity  (lou  or  high),  and  then 
cutting  it  into  the  appropriate  number  of  pieces  of  average 
complexity  equal  to  that  required  by  the  cell  (lou,  medium  or 
high).  Lou,  medium,  or  high  complexity  stimuli  could  be  combined 
to  form  either  a  lou  or  high  complexity  product.  Note  that  the 
level  of  stimulus  complexity  uas  defined  as  the  average  number  of 
points  on  the  tuo  or  three  to-be-combined  stimuli. 
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Apparatus 

All  stimuli  were  photographed  on  Kodak  ASA  40  Super  8  film 
for  presentation  on  a  single  frame  Super  8  projector.  Stimuli 
appeared  as  black  silhouettes  on  a  white  background.  Rotation 
arrows  and  construction  "  +  "  signs  were  also  black  on  a  white 
background.  Test  stimuli  and  the  "draw"  instruction  were 
silhouetted  on  a  yellow  background.  The  background  on  the  ready 
and  confidence  frames  was  red.  The  film  was  projected  on  a  21  x 
28  cm  rear  screen.  Average  stimulus  figure  size  within  this 
field  was  approximately  8  cm.  The  distance  between  subject  and 
screen  uas  approximately  60  cm.  Subjects  controlled  the 
presentation  of  stimuli  by  pressing  either  the  hand  held  "true" 
and  "false"  buttons,  or  one  of  the  four  confidence  buttons 
mounted  on  a  small  keyboard  in  front  of  the  subject. 

Pressing  any  one  of  these  buttons  would  cause  the  projector 
to  advance  to  the  next  frame.  A  South  West  Technical  Products 
6800  microprocessor  recorded  which  key  had  been  pressed,  the 
elapsed  time  on  the  frame  to  the  nearest  hundreth  of  a  second, 
and  then  advanced  the  projector  to  the  next  frame.  The  change 
from  one  frame  to  the  next  was  virtually  instantaneous. 

Procedure 

Each  subject  participated  in  two  sessions,  each  lasting 
approximately  two  hours.  In  the  first  session,  subjects  were 
given  a  general  orientation  to  the  experiment.  The  four 
reference  tests  were  then  administered  in  the  following  order: 
Cards  (four  minutes/part),  Copying  (three  minutes/part )  ,  Figures 
(eight  minutes  total),  and  Memory  for  Designs.  The  15  stimuli  in 
the  Memory  for  Designs  test  were  drawn  on  three  by  five  inch 
cards  and  presented  individually  for  five  seconds.  Subjects  then 
drew  each  design  after  it  was  removed.  They  were  told  that  the 
designs  would  be  scored  both  for  accuracy  and  correct 
proportions . 

The  60  simple  reaction  time  control  trials  were  given  next. 
Each  trial  began  with  a  red  background  "ready”  slide  which 
remained  on  the  screen  for  4.6  seconds.  Either  a  yellow  or  white 


32 


slide  then  appeared.  Subjects  were  told  to  answer  the  question 
"Is  it  yellow?"  as  rapidly  as  possible  by  pressing  one  of  the 
hand  held  buttons.  Subjects  pressed  the  "yes"  with  their 
preferred  hand.  Each  subject  was  first  given  10  practice  trials 
and  then  the  60  test  trials. 

After  a  ten  minute  break  It  demonstration  experimental 
trials  were  given.  Subjects  were  told  to  work  as  rapidly  as 
possible  but  to  be  sure  they  had  a  clear  image  of  the  stimulus 
before  advancing  to  the  next  frame.  They  were  told  that  a  yellow 
background  test  probe  could  occur  at  any  point  during  the  item. 
Further,  they  were  told  that  if  they  took  longer  than  one  second 
to  respond  to  the  test  probe  they  would  not  receive  credit  for 
the  item. 

The  demonstration  was  followed  by  nine  practice  items.  On 
both  practice  and  demonstration,  questions,  important 
distinctions  and  errors  were  explained  by  having  the  subject 
physically  combine  and  rotate  duplicate  cardboard  stimuli. 

Finally  the  first  block  of  experimental  trials  was 
presented.  Subjects  were  asked  to  rate  their  confidence  in  each 
answer  on  a  four  point  scale  (zero  =  guessing;  three  =  certain). 
They  were  asked  to  tell  the  experimenter  if  they  accidentally 
pressed  "yes"  when  they  meant  "no"  or  vise  versa,  and  to  press 
"no"  if  they  could  not  perform  a  given  construction  or  rotation. 
Finally,  they  were  told  that  their  drawings  would  be  scored  for 
both  accuracy  and  correct  proportions. 

In  the  second  session,  subjects  uere  first  given  ten 
practice  trials  from  the  first  block,  and  then  blocks  two.  three 
and  four  with  a  10  minute  break  between  blocks.  At  the 
conclusion  of  the  experiment  subjects  were  asked  to  explain  how 
they  were  solving  the  items.  They  were  then  thanked  and  paid 
three  dollars  per  hour  for  their  participation. 
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C-H.AP.T.E&  LLL 

RESULTS  MO  DISCUSSION 


Qverv 1 ew 

This  chapter  reports  the  major  results  of  the  experiment. 

It  is  divided  into  six  sections.  The  first  section  reports 
preliminary  analyses  on  the  formation  of  the  various  dependent 
measures,  removal  or  replacement  of  outliers,  computation  of 
individual  guessing  scores,  and  the  formation  of  a  new  index  of 
cor rec  tness . 

The  second  section  reports  a  series  of  regression  analyses 
on  the  major  speed-level  hypothesis.  These  analyses  ignore  the 
design  facets  and  examine  the  relationships  between  speed  and 
level  using  a  psychometric  index  of  item  difficulty. 

The  third  section  reports  the  separate  means  analyses  and 
correlational  analyses  on  total  time,  correctness,  and 
confidence.  Here,  the  effects  of  the  design  facets  on  latency 
and  correctness  are  examined  in  greater  detail.  The  effect  of 
practice  on  latency  and  correctness  is  first  examined  briefly. 
Then,  the  problem  of  reaction  times  for  incorrect  items  is 
discussed  and  analyses  on  correct  and  all  reaction  times  are 
reviewed.  Separate  analyses  on  total  time  and  correctness  are 
then  reported  in  some  detail.  The  interpretation  of  correlations 
between  individual  contrast  scores  and  reference  tests  is 
discussed,  and  the  contrast  correlations  for  the  separate 
analyses  presented.  The  results  of  the  total  time  and 
correctness  analyses  are  then  compared. 

In  the  fourth  section,  joint  analyses  on  total  time  and 
correctness  are  presented  and  compared  with  the  separate 
analyses.  It  is  argued  that  the  joint  indices  provide  unique 
insights  for  a  means  analysis,  but  may  distort  the  correlational 
analyses . 

In  the  fifth  section,  separate  analyses  on  the  encoding, 
construction,  rotation,  and  match  component  times  are  reported. 
The  relationships  between  component  times  and  scores,  and 
specific  reference  factors  such  as  Closure  Speed  and  Perceptual 
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Speed  are  also  discussed. 

In  the  sixth  and  final  section,  the  major  results  of  the 
total  time  and  correctness  analyses  are  compared  with  the  results 
of  the  component  time  and  correctness  analyses. 

I •  ExjtLi mnaa'.  An 

Dependent  treasures 

Three  types  of  scores  were  obtained  on  each  trial:  time, 

correctness  and  confidence.  Time  on  each  frame  was  recorded  to 
the  nearest  hundredth  of  a  second.  Total  trial  time  uas 
determined  by  summing  the  component  times,  excluding,  of  course, 
the  "ready”  and  "confidence"  frame  times. 

Two  correctness  measures  were  obtained.  The  first 
represented  the  correctness  of  the  subject's  response  to  the  test 
(match)  stimulus.  The  second  measure  uas  at  the  component  level. 
Subjects  uere  requested  to  press  the  "no"  button  if  they  were 
unable  to  perform  a  particular  component  operation. 

At  the  conclusion  of  the  item,  subjects  rated  their 
confidence  in  their  response  to  the  test  stimulus  on  a  zero 
(guessing)  to  three  (certain)  point  scale. 


Outliers  uere  identified  by  examining  scatterplots  of  total 
and  component  times  versus  composite  design  vector  indices. 

This  permitted  an  examination  of  the  distribution  of  scores 
uithin  each  cell  of  the  design.  For  example,  the  abcissa  in  the 
tota  tine  scatterplots  represented  a  linear  ordering  of  cells 
according  to  construction,  rotation,  stimulus  complexity,  and 
product  complexity. 

These  scatterplots  revealed  that  an  arbitrary  statistical 
rule  lor  defining  outliers  uould  be  unuorkable.  Times  that  uere 
extremely  deviant  for  the  simplest  item  types  uere  not 
necessarily  extreme  uithin  the  total  distribution.  Only  .3  to  .7 
percent  of  the  scores  uere  identified  as  outliers  by  this  method. 
These  scores  uere  either  eliminated,  or  replaced  by  the  next 
highest  time  in  the  cell,  depending  on  the  analysis. 
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£&SLP.Ut a U.gn  2±  Individual  guessing  Scores 

On  any  rating  scale,  some  subjects  utilise  the  entire  scale, 
uhile  others  concentrate  on  one  end  of  the  scale.  Thus,  a  new 
variable  called  GUESS  was  calculated  for  each  subject  by 
computing  the  probability  of  a  correct  response  at  each  level  of 
reported  confidence.  Guess  was  then  defined  as  the  level  of 
confidence  where  the  probability  of  a  correct  response  was  not 
greater  than  .5  at  an  alpha  of  .10. 

A  n.O.rje  SAcinqgn.t  Definition  Correctness 

Component  times  become  un i n t er p r e t ab 1 e  if  subjects  do  not 
perform  the  operations  indicated  on  a  given  frame  before 
advancing  to  the  next  frame.  The  random  presentation  of  items, 
the  unpredictabi 1 i ty  of  the  test  probe,  the  draw  trials,  and  the 
instruction  to  respond  to  the  test  probe  within  one  second  were 
all  designed  to  minimize  this  problem. 

A  number  of  different  factors,  including  failure  to  perform 
component  operations  when  requested,  may  result  in  an  unusually 
long  discrimination  reaction  time.  The  distribution  of  these 
"match"  times  was  produced  for  each  subject.  All  showed  some 
degree  of  bimodality.  The  lowest  point  between  the  two  peaks  was 
then  used  to  define  a  new  variable  called  THATCHN  (N  for  Node) 
for  each  subject.  These  node  times  ranged  from  1.6  to  3.2 
seconds,  with  a  median  of  2.0  seconds.  An  example  of  the 
distribution  of  match  times  and  the  location  of  TMATCHN  for  one 
subject  i3  shown  in  Figure  3.1. 

These  new  indices  were  used  to  define  an  additional 
criterion  of  item  correctness  called  SCORE3 .  An  item  was  counted 
correct  only  if  it  was  correct  (i.e.  SCORE  =  1),  if  confidence 
was  greater  than  GUESS,  if  there  were  no  "no”  responses  to 
component  operations,  and  if  match  time  was  less  than  TflATCHN. 

11.  B.g,g.fgs?  i<?n  Analyses 

The  relationships  between  latency  and  correctness  may  be 
examined  in  two  ways.  The  first  ignores  the  design  facets,  is 
more  atheoretic.  briefer  and  resembles  the  traditional 
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psychometric  type  of  analysis.  The  second  examines  the 
relationships  between  latency  and  correctness  in  the  light  of  the 
design  facets,  assumes  that  these  facets  affect  differential 
changes  in  performance,  is  more  complex,  but  also  more 
enlightening.  This  section  reports  four  analyses  of  the  first 
type.  Analyses  of  the  second  type  constitute  the  remainder  of 
the  chapter . 

Cana  t  C V g  t  Validity  a_i  l_h_e  key e  1  Scores 

Before  examining  the  speed-level  hypothesis,  it  is  first 
necessary  to  establish  the  construct  validity  of  the  level  scor<-. 
Correlations  between  total  SCORE,  total  SCORE  3 ,  total  correct 
Draw  trials,  and  the  major  reference  tests  are  shown  in  Table 
3.1.  Total  SCORE  correlated  .75  with  Paper  Folding  and  .66  with 
Memory  for  D es i g ns -Qua  1 i t y .  The  Draw  total  correlated  .81  with 
Paper  Folding,  .77  with  Surface  Development,  and  .73  with  Memory 
for  Des i gns -Qua  1 i t y .  Further,  Draw  and  total  correct 
i n t erco r r e 1  a t ed  .82.  Clearly,  individual  differences  in  level  on 
this  task  were  congruent  with  individual  differences  in  spatial 
ability  on  the  reference  tests. 

All  three  variables  had  their  highest  correlations  with 
spatial  tests.  With  two  notable  exceptions,  the  spatial  tests 
correlations  for  SCORE  and  SC0RE3  were  virtually  identical. 

SCORE  correlated  higher  with  Paper  Folding,  while  SCO R E 3  had  the 
higher  correlation  with  Figures.  The  crucial  difference  appeared 
in  the  correlations  with  verbal  tests:  SCORE  had  uniformly 
higher  correlations  with  verbal  tests  than  SCORE3 . 

To  clarify  these  relationships,  two  clusters  of  reference 
tests  were  formed.  The  spatial  cluster  was  composed  of  Paper 
Folding,  Form  Board,  Surface  Development,  Cards,  Figures  and 
Memory  for  Des igns -Qua  1 i t y .  The  verbal  cluster  contained  MAIS 
Information,  Comprehension.  Similarities.  Vocabulary,  and  Word 
Beginnings  and  Endings.  SCORE  and  SCO R E 3  were  then  correlated 
with  the  sum  and  the  difference  of  these  two  clusters.  The 
verbal  plus  spatial  composite  represents  general  mental  ability 
(in  fact,  it  correlated  .99  with  UAIS  Full  Scale  Score),  while 


Table  3.1 

Correlations  between  Total  SCORE,  Total  SC0TE3,  Total 
Correct  Draw  Trials,  and  Reference  Tests  (N-30) 


Reference 

Test 

Total 

SCORE 

Total 

SC0RE3 

Total 

Draw 

Paper  Folding 

75 

57 

81 

Form  Board 

48 

45 

58 

Surface  Development 

66 

61 

77 

Copying 

42 

32 

57 

Cards 

49 

41 

50 

Figures 

45 

54 

46 

Memory  for  Designs 

39 

41 

53 

Memory  for  Designs  -  Quality 

66 

63 

73 

Visual  Number  Span 

40 

42 

43 

Identical  Pictures 

39 

26 

50 

Word  Transformations 

36 

25 

43 

Camoflauged  Words 

40 

04 

48 

Hidden  Figures 

53 

30 

52 

Word  Beginnings  and  Endings 

41 

21 

52 

Necessary  Arithmetic  Operations 

47 

21 

52 

Letter  Series 

67 

41 

55 

Terman  Concept  Mastery 

51 

27 

64 

Raven  Progressive  Matrices 

45 

IS 

49 

WAIS  Information 

48 

26 

54 

Comprehension 

33 

25 

55 

Similarities 

46 

38 

5b 

Digit  Span 

51 

44 

36 

Vocabulary 

50 

24 

61 

Digit  Symbol 

27 

15 

47 

Block  Design 

48 

23 

50 

Picture  Arrangement 

35 

41 

42 

Object  Assembly 

28 

29 

51 

Verbal  Scale  Score 

53 

34 

64 

Performance  Scale  Score 

52 

39 

69 

Full  Scale  Score 

60 

41 

76 

Note.  Decimals  omitted. 
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the  spatial  minus  verbal  composite  represents  the  s pa t 1  a  1  - v e r ba 1 
distinction.  If  SCOP.  E  3  is  a  "cleaner"  spatial  score,  it  should 
correlate  higher  uith  the  spatial  end  of  the  s pa t i a  1 -verba  1 
composite  than  SCORE.  The  correlations  with  the  sura  composite 
were  .73  and  .64,  and  with  difference  composite  .29  and  .42  for 
SCORE  and  SC0RE3,  respectively.  Thus,  SC0RE3  is  a  cleaner 
measure  of  spatial  ability  than  SCORE. 

Sp.gert-I.gvsl  Sam  La  U  sms. 

Items  were  first  rank  ordered  according  to  percent  of  the 
sample  failing  the  item  by  SC0RE3.  Total  item  time  uas  then 
regressed  on  percent  failing,  separately  for  each  subject.  The 
analysis  uas  then  repeated  using  only  those  times  for  items 
correct  by  SC0RE3.  Since  items  uere  ordered  on  a  zero  to  one 
scale,  the  intercept  provided  the  best  estimate  of  speed  of 
solving  the  simplest  item  types.  Predicted  speed  of  solving  the 
most  difficult  items  uas  estimated  by  simply  summing  the  slope 
and  intercept.  Correlations  betueen  intercepts,  slopes, 
predicted  times  at  maximum  difficulty  for  the  tuo  analyses,  and 
reference  spatial  tests  are  shoun  in  Table  3.2.  Table  3.3 
contains  a  similar  set  of  correlations,  this  time  excluding  the 
scores  for  one  subject  uho  had  an  extremely  steep  slope. 

The  most  important  correlations  are  those  betueen  the  three 
variables  that  represent  level  on  this  task  (SCORE.  SC0RE3 ,  Orau) 
and  the  intercepts  and  predicted  times  at  maximum  difficulty.  In 
all  four  analyses,  the  intercepts  uere  independent  of  the  level 
scores.  On  the  other  hand,  predicted  times  at  maximum  difficulty 
uere  highly  correlated  uith  total  SCORE  and  total  Drau,  but 
unrelated  to  total  SC0RE3 . 

Examination  of  the  correlations  uith  reference  tests  explain 
this  discrepancy  betueen  the  SCORE  and  SC0RE3  correlations  uith 
predicted  times  for  difficult  items.  In  all  four  analyses,  the 
intercept  correlated  highest  uith  a  strategy  index,  number  of 
pencil  marks  each  subject  made  on  the  Form  Board  test.  The 
correlations  uere  strongest  in  the  analyses  on  correct  times. 

When  all  times  uere  included,  the  correlation  uith  Closure  Speed 


Table  3.2 

Correlations  between  Slopes,  Intercepts,  Predicted  Times 
at  Maximum  Difficulty,  and  Reference  Tests  (N-30) 


Reference  Test 

All 

Times 

Correct 

Times 

Only 

Intercept 

Slope 

A 

T 

Intercept 

Slope 

A 

T 

Paper  Folding 

40 

35 

Form  Board 

-33 

Form  Board  -  Marks 

41 

54 

Surface  Development 

-38 

32 

Copying 

-36 

Cards 

Cards  -  Wrong 

49 

45 

59 

53 

Figures 

Closure  Speed 

-45 

Visual  Number  Span 

34 

39 

33 

38 

Identical  Pictures 

Object  Assembly 

-40 

35 

-31 

Memory  for  Designs-  Quality 

Word  Beginnings  4  Endings 

47 

40 

36 

32 

Necessary  Arith.  Operations 

44 

39 

40 

37 

Terman  Concept  Mastery 

54 

53 

50 

49 

Information 

46 

44 

44 

43 

Comprehension 

44 

36 

42 

35 

Similarities 

-36 

38 

-30 

Digit  Span 

46 

39 

32 

Vocabulary 

54 

49 

47 

44 

Verbal  Scale  Score 

58 

50 

50 

44 

Performance  Scale  Score 

-33 

Full  Scale  Score 

-32 

52 

42 

40 

34 

Draw  Total 

58 

53 

45 

44 

Total  SCORE 

56 

56 

42 

46 

Total  SC0RE3 


Note.  Decimals  and  correlations  less  than  .30  omitted. 
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Table  3.3 

Correlations  between  Slopes,  Intercepts,  Predicted  Times 
at  Maximum  Difficulty,  and  Reference  Tests  (N*29) 


Reference  Test 

All 

Times 

Correct 

Times 

Only 

Intercept 

Slope 

A 

T 

Intercept 

Slope 

T 

Paper  Folding 

53 

46 

42 

37 

Form  Board 

-34 

36 

Form  Board  -  Marks 

41 

37 

53 

30 

Surface  Development 

-38 

48 

32 

33 

Copying 

-38 

Cards 

Figures 

Closure  Speed 

-47 

-34 

Visual  Number  Span 

39 

44 

44 

47 

Identical  Pictures 

40 

32 

Hidden  Figures 

50 

50 

40 

41 

Word  Beginnings  &  Endings 

48 

38 

35 

Necessary  Arith.  Operations 

44 

36 

42 

35 

Letter  Series 

43 

47 

41 

45 

Terman  Concept  Mascery 

52 

48 

49 

46 

WAIS  Information 

43 

41 

46 

42 

Comprehension 

3.0 

Similarities 

-36 

37 

Digit  Span 

46 

37 

Object  Assembly 

-40 

Verbal  Score 

55 

43 

46 

36 

Performance  Score 

-33 

37 

Memory  for  Designs-Quality 

61 

56 

59 

54 

Draw  Total  Correct 

70 

61 

62 

55 

Total  SCORE 

69 

66 

59 

58 

Total  SC0RE3 

40 

33 

32 

Note.  Decimals  and  correlations  less  than  .30  omitted. 


became  significant.  When  the  outlier  was  removed  (Table  3. 3). 
correlations  between  the  intercept  and  other  spatial  tests 
surfaced,  although  none  reached  significance. 

The  slope,  on  the  other  hand,  had  its  highest  correlations 
with  verbal  tests  in  the  full  sample  analyses  (Table  3.2).  Here, 
uhen  all  times  were  included,  slope  correlated  highest  with  UAIS 
Verbal  Scale  Score.  Terman  Concept  Mastery.  UAIS  Vocabulary,  and 
number  of  errors  on  the  Cards  test.  Later  results  will  show  that 
Cards  Urong  is  actually  an  index  of  verbal  strategy,  not  spatial 
ability.  Uhen  only  correct  times  were  included  in  the  analysis, 
slope  correlated  exclusively  with  verbal  tests  and  Cards  Urong. 

Uhen  the  outlier  was  removed,  the  correlations  presented 
more  of  a  mixed  bag.  Here  (see  Table  3.3),  slope  correlated 
highest  with  Memory  for  Designs  -  Quality.  UAIS  Verbal  Scale 
Score.  Terman  Concept  Mastery,  and  Hidden  Figures.  However,  on 
balance,  the  correlations  were  still  stronger  with  verbal  tests 
than  spatial  tests. 

In  sum,  the  major  hypothesis  of  this  investigation  is 
supported  by  these  results,  but  with  an  unexpected  twist.  Speed 
of  solving  simple  items  was  unrelated  to  level  on  the  task. 

Speed  of  so’lving  difficult  items  was  highly  correlated  with  two 
of  the  level  scores.  However,  it  was  expected  that  these 
predicted  times  at  maximum  difficulty  would  be  negatively 
correlated  with  spatial  reference  tests.  Instead,  predicted 
times  for  difficult  items  were  positively  correlated  with  verbal 
ref  e  rence  tests. 

Subsequent  analyses  on  correctness  and  latency  will  clarify 
these  relationships  between  verbal  ability,  spatial  ability,  and 
speed  and  level  scores  on  the  experimental  task.  This  is 
accomplished  by  examining  the  structure  of  cell  means,  through 
analysis  of  variance,  and  the  relationships  between  individual 
differences  in  patterns  of  scores  over  design  facets  and 
reference  tests,  through  contrast  and  cell  total  correlations. 
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111  •  Lfi.'uxai®  ft.aaly.?g?  an  Total  Time.  Correctness  and  Confidence . 
Llms.1i.zjz  Sl.teat.s 

For  the  group  data,  there  was  a  significant  decrease  in 
total  time  over  the  four  blocks.  Mean  times  averaged  over 
subjects  and  items  within  a  block  were  19.46.  16.18.  14.92.  and 

13.06  sec  for  blocks  one  through  four,  respectively.  An  analysis 
of  variance  indicated  that  while  the  linear  term  captured  most  of 
the  variance  (F  ~  255.51)  the  quadratic  and  cubic  terms  were  also 
significant  (F  =  6.09,  p  =  .0138  and  F  =  4.16.  p  =  .042, 
respectively) . 

At  the  individual  level,  nine  subjects  showed  no  change  or 
minimal  change  in  RT  over  blocks,  thirteen  showed  a  definite 
decline  in  RT .  and  eight  had  other  patterns  (such  as  a  large  drop 
between  block  one  and  two,  but  steady  thereafter,  or  a  gradual 
increase  in  RT  over  blocks). 

Subjects  also  tended  to  make  fewer  errors  with  practice. 

The  percentage  of  correct  responses  was  .78,  .80,  .83  and  .83  for 

blocks  one  through  four  respectively.  The  corresponding  values 
for  SCORE 3  were  .60,  .67  ,  .72.  and  .72. 

£1  1st  Incorrect  items 

When  a  subject  fails  to  solve  an  item,  the  reaction  time  for 
that  item  is  ambiguous.  While  some  investigators  regularly 
include  both  correct  and  incorrect  reaction  times  in  their 
analyses  (e.g.,  Sternberg.  1977),  the  usual  procedure  is  to 
exclude  times  from  incorrect  items  (Pachella.  1974).  However, 
this  experiment  differs  from  the  classical  RT  study  in  two 
important  ways: 

1.  Subjects  were  deliberately  selected  to 
represent  a  wide  range  of  individual  differences  in 
spatial  ability.  Using  only  reaction  times  for  correct 
items  would  bias  the  results  by  weighing  more  heavily 
the  performance  of  the  more  able  subjects. 

2.  Similarly,  items  were  constructed  in  order  to 
represent  a  wide  range  of  item  difficulty  or 
complexity.  An  analysis  of  correct  reaction  times 


44 


would  bo  buied,  (jiving  more  weight  to  the  ensier 
items  . 

In  any  case  the  issue  of  whether  to  include  only  the  "clean" 
correct  RT’s  or  all  RT’s  in  an  analysis  is  an  empirical  matter. 

As  a  preliminary  check,  an  analysis  ol  variance  was  performed  on 
all  reaction  times  (except  30  outliers,  which  were  excluded)  and 
on  the  subset  of  those  times  where  the  item  was  correct  and 
confidence  was  greater  than  GUESS.  The  results  are  reported 
elsewhere  (see  Lehman,  1979b).  Note  that  these  are  not  repeated 
measures  analyses  of  variance.  Computer  limitations  precluded 
doing  this  type  of  analysis  on  the  incomplete  data  matrix  defined 
by  the  second  analysis.  Therefore,  all  subject  factor 
interactions  wore  included  in  the  error  term.  Both  analyses 
followed  a  regression  model  in  which  main  effects  were  entered 
first,  two  way  interactions  second,  etc. 

Including  all  times  in  the  analysis  added  some  noise.  The 
squared  multiple  correlations  were  .352  and  .919  for  the  complete 
and  restricted  data  sets,  respectively.  A  few  interactions  were 
significant  in  one  analysis  and  nonsignificant  in  the  other,  but 
on  the  whole,  the  results  were  quite  similar.  This  does  not  moan 
that  there  are  not  important  psychological  differences  between 
correct  and  incorrect  reaction  times.  Rather,  it  merely 
indicates  that  including  all  times  in  the  analysis  of  variance 
did  not  seriously  alter  the  results. 

AiLHlxa.i.1  an  La.U.1  Ixmn 

BaatuL  JLnjLixrLa. 

A  repeated  measures  analysis  of  variance  on  total  time  is 
shown  in  Table  3.9.  Times  for  both  correct  and  incorrect  items 
were  included  in  the  analysis.  Thirty  extreme  observations  were 
replaced  by  the  next  highest  within  cell  time. 

All  main  effects  were  in  the  expected  direction,  except 
match,  which  was  not  significant.  Construction,  rotation,  and 
stimulus  complexity  all  had  strong  linear  components  and  weak, 
but  significant  quadratic  components.  These  main  effects  are 
plotted  in  column  one  of  Figure  3.5.  below.  Note  also  that  there 
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Table  3.4 


Analysis  of  Variance  on  Total  Time 


Source 

df 

MS 

F 

Source 

df 

MS 

F 

Mean 

l 

1624939.49 

263.10* 

N 

29 

6176.11 

- 

Construct  ion 

(2) 

NC 

(58) 

Linear 

l 

131612.26 

127.81* 

Linear 

29 

1020.  •(, 

33.72* 

Quadrat ic 

1 

2089.17 

64.61* 

Quadrat ic 

29 

32.  34 

1 . 06 

Rotation 

(2) 

NR 

(581 

Linear 

l 

33597.19 

121.55* 

Linear 

29 

276.40 

9. OS* 

Quadrat ic 

l 

2295.83 

35.29* 

Quadrat ic 

29 

65.05 

2.13* 

Stimulus  Complexity 

(2) 

NS 

(58) 

Linear 

1 

22628. 14 

54.38* 

Linear 

29 

416.  12 

13.63* 

Quadratic 

l 

2<>8.99 

6.01 

Quadrat ic 

29 

49.78 

1.63 

Match 

l 

23.66 

.69 

NM 

29 

32.28 

1.  12 

Type  of  Addition 

l 

2787.89 

47.45* 

NT 

20 

58.75 

l .  02* 

Product  Complexity 

l 

16867.77 

38.58* 

NP 

29 

437.  18 

U.  32* 

CR 

4 

359.27 

11. tl* 

NCR 

116 

32.34 

1.06 

CS 

4 

1009.25 

22.55* 

NCS 

U  6 

44.75 

1.47* 

CM 

2 

15.62 

.79 

NCM 

58 

19.76 

.  65 

KT 

l 

3003.60 

38.51* 

NKT 

20 

78.00 

2.55* 

KP 

l 

4026.37 

31 .07* 

NKF 

20 

120.58 

4.24* 

RS 

4 

1 18.33 

4.01* 

NRS 

l  1 6 

2°.  .«o 

.07 

RM 

dm 

276.42 

5.88* 

NRM 

58 

47.02 

l .  5*** 

RT 

2 

976.06 

16.76* 

NRT 

58 

58.24 

1.91* 

RP 

2 

1006.13 

22. 32* 

NRP 

58 

4  5.08 

1.48 

SM 

'j 

dm 

46.96 

1.48 

NSM 

58 

31.75 

1.04 

ST 

2 

1175.67 

18.06* 

NST 

58 

65.09 

2.13* 

SP 

2 

1067. 90 

10.33* 

NSP 

58 

103.38 

3.39* 

MT 

i 

632.59 

13.23* 

NKT 

2° 

47.83 

1.57 

MP 

i 

1195.24 

1 1 . 80* 

NMP 

29 

101.26 

3.  32* 

TP 

i 

2834.35 

26.24* 

NTP 

29 

108.02 

3.  54* 

CRS 

s 

124.37 

2.87 

NCRS 

232 

43.20 

1.42* 

CRM 

4 

378.73 

8 . 69* 

NORM 

116 

43.58 

1.43* 

CSM 

4 

566.70 

11.91* 

NCSM 

116 

47.50 

1.56* 

Table  3.4  (continued) 


Source 

df 

MS 

F 

Source 

df 

MS 

F 

KRSMT 

4 

53.22 

.90 

NKRSMT 

116 

59.22 

1.94* 

KRSMP 

4 

396.04 

5.78* 

NKRSMP 

116 

63.48 

2.24* 

KRSTP 

4 

433.76 

8.55* 

NKRSTP 

116 

50.71 

1.66* 

KRMTP 

2 

447.89 

7.33* 

NKRMTP 

58 

61.07 

2.00* 

KSMTP 

2 

482.70 

3.63 

NKSMTP 

58 

132.86 

4.35* 

RSMTP 

4 

796.33 

9.71* 

NRSMTP 

116 

82.02 

2.69* 

KRSMTP 

4 

1237.93 

11.33* 

NKRSMTP 

Residual 

116 

1620 

109.22 

30.54 

3.58* 

were  many  large  interactions.  Thus,  a  simple  additive  model  does 
not  lit  the  data  well. 

All  two-way  interactions  except  CM  and  SM  were  significant. 
The  eight  largest  of  these  are  plotted  in  Figure  3.2.  The 
construction  by  rotation  interaction  indicates  that  the  time 
taken  to  rotate  and  match  a  constructed  stimulus  was  slightly 
greater  at  180  degrees  for  two  piece  additions  than  one  piece 
additions. 

Although  the  type  of  addition  main  effect  was  significant, 
the  construction  by  type  (KT)  interaction  reveals  that  left 
additions  took  longer  only  for  two  piece  additions.  Similarly, 
the  type  of  addition  by  product  comlexity  (TP)  interaction 
reveals  that  left  and  right  additions  at  low  product  complexity 
took  the  same  amount  of  time,  while  left  additions  were  much  more 
time  consuming  than  right  additions  at  high  product  complexity. 

The  construction  by  product  complexity  (KP)  interaction 
reflects  the  fact  that  high  product  complexity  additions  were 
relatively  more  time  consuming  for  two  piece  additions  than  one 
piece  additions.  Note  that  in  all  interactions  between 
construction  and  type  of  addition  or  product  complexity, 
construction  is  represented  by  K.  This  is  necessary  in  order  to 
distinguish  between  the  two  level  construction  contrast  between 
two  piece  additions  and  one  piece  additions  (K),  and  the  C  linear 
and  quadratic  contrasts  that  span  the  entire  three  levels  of  the 
construction  facet.  Discussion  of  the  RT  and  RP  interactions  is 
best  postponed  until  after  the  contrast  correlations  are 
presented  and  correctness  and  latency  are  examined  jointly. 


Two  apparent  limitations  of  the  correlations  between  these 
contrast  scores  and  reference  variables  merit  brief  comment. 
First,  correlations  based  on  only  30  cases  are  usually  quite 
unstable.  However,  subjects  were  not  selected  randomly;  rather 
they  were  deliberately  chosen  to  be  a  representative  sample  of 


verbal  and  spatial  ability  in  the  larger  population  (N  *  171  high 
school  and  college  males).  Therefore,  the  correlations  reported 
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here  are  much  more  dependable  than  the  sample  size  suggests. 

Second,  all  contrasts  except  the  N  contrast  are  difference 
scores.  Difference  scores  are  known  to  be  relatively  unreliable 
(Lord.  1963;  Cronbach  and  Furby,  1970).  Five  factors  mitigate 
these  complaints  . 

1.  Each  contrast  score  is  based  on  at  least  144 
observations  and  as  many  as  216  observations  for 
each  subject.  Thus,  the  means  and  differences 
between  them  should  be  quite  stable. 

2.  As  Cronbach  and  Furby  (1970)  point  out.  linked 
errors  can  substantially  enhance  the  reliability  of 
difference  scores.  Since  all  observations  were 
obtained  from  one  task  there  necessarily  are  linked 
errors  in  these  scores.  Hence,  the  reliability  of 
the  difference  scores  is  greater  than  would  be 
estimated  by  conventional  methods  (i.e..  Lord, 

1963  )  . 

3.  All  reliability  coefficients  are  merely  estimates 
of  score  reliability.  Low  estimated  reliability 
may  help  explain  why  a  score  fails  to  correlate 
with  other  variables.  However,  if  the  reliability 
estimates  are  low,  and  the  correlations  are  high, 
then  it  is  the  reliability  estimate  that  needs 
explaining.  Further,  even  within  the  rigid 
assumptions  of  classical  test  theory,  it  is  the 
square  root  of  the  estimated  reliability  that 
defines  the  limit  in  validity.  Thus,  if  the 
reliability  of  a  test  is  only  .25,  the  test  may  (in 
theory)  correlate  .50  with  another  variable. 

4.  It  can  be  shown  (see  Lohman,  1979b)  that  the 
correlation  between  the  difference  score  X-Y  and 
another  variable  Z  will  be  greater  than  the 
difference  between  r(X.Z)  and  r(Y,Z)  if  r(X,Y)  is 
greater  than  .50.  If-r(X.Y)  is  .80,  then  r(X-Y.Z) 

=  1.58  lr(X,2)-  r ( Y  ,  Z  )  I .  If  r(X.Y)*.95,  then  r(X- 
Y.Z)  =  3.16  (r(X.Z)-r(X.Y)  I.  Thus,  when  cell 
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intercorrelations  are  high,  as  in  this  task, 
correlations  between  individual  contrast  scores  and 
reference  tests  may  be  quite  high.  Further,  since 
the  contrast  score  correlation  with  a  reference 
test  will  be  much  larger  than  the  difference 
between  corresponding  correlations  between  each 
cell  and  the  reference  test,  it  is  much  easier  to 
detect  differential  correlations  between 
performance  over  a  task  and  reference  tests  using 
the  contrast  correlations. 

5.  In  most  educational  applications,  the  absolute 

magnitude  of  the  gain  score  is  frequently  about  the 
size  of  the  standard  error  of  measurement  of  the 
raw  scores,  and  much  smaller  than  the  range  of 
individual  differences  in  the  raw  scores.  However, 
th.e  magnitude  of  the  difference  score  over  a 
powerful  facet  in  a  within  subjects  design  may  be 
much  larger  than  the  standard  error  of  the  raw 
scores  and  even  larger  than  the  the  range  of 
individual  differences  in  the  raw  scores. 

Thus,  contrast  scores  may  be  more  dependable  and  more  useful 
than  is  commonly  assumed  in  the  psychometric  literature  on 
difference  scores.  Nevertheless,  there  are  three  important 
limitations  of  contrast  scores  that  must  be  noted: 

1.  Even  though  contrast  scores  may  be  more  dependable  than  the 
psychometric  literature  suggests,  they  are  still  less 
reliable  than  their  corresponding  raw  scores. 

2.  Ceiling  and  floor  effects  in  either  raw  score  can  destroy  the 
validity  of  the  difference  score. 

3.  While  analysis  of  variance  procedures  are  relatively  robust 
with  respect  to  violations  of  the  equal  variance  assumptions, 
correlations  are  not.  Thus,  correlations  between  contrast 
scores  and  other  variables  can  be  seriously  biased  if  the 
within  cell  variances  are  not  a pp roxima t el y  equal.  In 
particular,  such  correlations  will  tend  to  reflect  the  rank 
order  of  raw  scores  in  the  cell  with  the  greater  variance. 
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T.Ql&l  IiP-g  C.3Pt  r  g  S  t  Correlat  i  ons 

Significant  correlations  between  reference  tests  and  the  ten 
total  tine  nain  effect  contrasts  are  summarized  in  Table  3.5. 

M  Contrast .  The  N  contrast  represents  subject  means.  Mean 
total  time  correlated  .49  with  total  SCORE  but  only  .18  with 
total  SC0RE3.  Therefore,  those  who  take  longer  tend  to  get  more 
items  correct.  However,  when  scores  are  corrected  for  guessing 
and  extremely  long  match  times,  mean  time  and  correctness  become 
independent.  The  only  other  significant  H  contrast  correlation 
was  .43  with  Terman  Concept  Mastery.  Correlations  with  other 
verbal  tests  were  positive,  but  not  significant  (i.e.,  .37  with 

WAIS  Information,  and  .35  with  Vocabulary),  while  correlations 
with  spatial  tests  were  all  much  smaller,  and,  of  course,  not 
significant.  Thus,  if  anything,  mean  time  appears  to  be  more 
related  to  verbal  ability  than  spatial  ability. 

Construction  Contrasts.  The  construction  linear  contrast 
represents  the  difference  between  average  time  on  two  piece 
additions  and  no  addition  items.  Scores  on  this  contrast  were 
most  consistently  correlated  with  verbal  tests  and  number  of 
errors  on  the  Cards  test.  The  quadratic  contrast  represents  the 
difference  between  the  average  of  no  additions  plus  two  piece 
additions,  and  one  piece  additions.  High  positive  scores  reflect 
little  change  between  zero  addition  and  one  addition,  and  a  large 
increase  between  one  and  two  additions.  Negative  scores  reflect 
the  reverse  pattern.  The  contrast  correlated  negatively  with 
Visual  Number  Span  (r*-.4S)  and  Hidden  Figures  (r=-.51)  and 
positively  with  Marks  Visual  Imagery  Questionaire  (r=.43). 
Therefore,  those  who  reported  vivid  imagery  were  more  likely  to 
show  the  most  change  between  no  additions  and  one  addition,  while 
those  with  high  scores  on  Hidden  Figures  and  Visual  Number  Span 
showed  larger  increases  between  one  and  two  piece  additions.  The 
most  important  finding  here  was  the  opposition  between  Hidden 
Figures,  which  is  probably  the  most  "analytic"  spatial  test  in 
the  battery,  and  the  reported  vividness  of  visual  imagery.  This 
suggests  that  an  analytic  approach  to  the  construction  task  can 
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Correlations  between  Total  Time  Main  Effect  Contrast  Scores  and  Reference  Tests  <K*3G) 


Note,  Decimals  and  correlations  less  than  .42  omitted 


be  distinguished  from  a  more  holistic  imagery  approach.  The  tuo 
variables  were  associated  with  different  patterns  within  the 
task,  even  though  neither  uas  s ign i f i can t 1 y  related  to  overall 
performance. 

Rotation  Cont  ras  ts .  There  were  only  two  significant 
correlations  between  the  rotation  contrasts  and  reference  tests. 
The  linear  contrast  correlated  .57  with  errors  on  the  Cards  test, 
and  the  quadratic  contrast  correlated  .54  with  UAIS  Picture 
Completion.-  Later  results  suggest  that  errors  on  the  Cards  test 
are  related  to  verbal  strategies.  Thus,  those  who  showed  the 
greatest  time  differences  between  no  rotation  and  180  degree 
rotation  may  have  been  using  nonspatial  strategies  to  rotate  the 
stimuli . 

S t inu l us  Complexity.  Both  the  linear  and  quadratic  stimulus 
complexity  contrasts  correlated  strongly  with  verbal  ability. 

The  linear  contrast  correlated  significantly  with  seven  reference 
tests,  and  only  one  of  these  was  a  spatial  test  (Paper  Folding, 
r=.44).  The  highest  correlation  was  with  UAIS  Verbal  Scale  Score 
( r= . 60 ) . 

The  stimulus  complexity  quadratic  contrast  correlated 
significantly  with  fouteen  reference  tests.  Ten  of  these  were 
verbal  or  verbal  reasoning  tests,  three  were  perceptual  speed 
tests,  and  one  was  a  spatial  test  (Paper  Folding,  r=.47).  The 
highest  correlations  were  .61  with  UAIS  Verbal  Scale  Score  and 
Digit  Span  Forward.  Thus,  those  who  showed  the  largest  increase 
in  total  time  between  low  and  medium  complexity  stimuli  had  low 
scores  on  these  reference  tests,  while  those  who  showed  the 
greatest  increase  between  medium  and  high  complexity  stimuli 
scored  highest  on  these  reference  tests. 

Ha  tch .  Although  the  match  facet  was  not  significant  in  the 
analysis  of  variance,  it  did  have  four  large  correlations  with 
reference  tests.  Scores  on  this  contrast  represent  the 
difference  between  time  taken  to  accept  a  correct  test  stimulus 
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and  the  time  required  to  reject  an  incorrect  stimulus.  Thus, 
positive  scores  reflect  longer  acceptance  times  and  negative 
scores  longer  rejection  times.  The  contrast  correlated 
negatively  with  Form  Board  and  Picture  Completion,  and  positively 
with  Cards  Wrong  and  Visual  Number  Span.  Therefore,  those  who 
were  faster  accepting  a  correct  probe  scored  higher  on  Form  Board 
and  Picture  Completion,  while  those  who  were  faster  rejecting  an 
incorrect  probe  made  more  errors  on  the  Cards  test  and  scored 
higher  on  the  Visual  Number  Span  test.  Both  of  these  latter 
variables  were  associated  with  verbal  strategies  throughout  the 
analysis.  Form  Board  and  Picture  Completion,  on  the  other  hand, 
are  good  examples  of  spatial  tests  that  place  a  premium  on 
comparing  a  constructed  or  remembered  stimulus  with  a  given 
stimulus.  Therefore,  as  suggested  in  the  comparison  of  SCORE  and 
SC0RE3.  match  times  may  provide  a  useful  index  of  solution 
s  t  r  a  t  eg  y  . 

Type  aJL  Addition .  Individual  scores  on  this  contrast 
correlated  significantly  with  eight  reference  tests.  The  highest 
correlation  was  .62  with  WAIS  Object  Assembly.  Thus,  those  who 
obtained  higher  scores  on  this  test  took  much  longer  on  left 
additions  than  on  right  additions.  Six  correlations  (ranging 
from  r  *  .  k  3  to  .52)  were  with  verbal  tests,  and  one  with  another 
spatial  test  (Paper  Folding,  r=.k8).  Therefore,  excepting  the 
strong  correlation  with  Object  Assembly,  time  once  again  was 
related  to  verbal  ability. 

Product  Complexity.  Contrast  scores  on  this  facet 
correlated  with  both  spatial  tests  and  verbal  tests.  Therefore, 
the  effect  is  most  clearly  associated  with  general  ability.  High 
ability  subjects  took  much  longer  on  high  complexity  products 
than  low  complexity  products.  Further,  this  contrast  was  the 
best  predictor  of  overall  performance  on  this  task.  The 
correlation  with  total  SCORE  was  .65,  and  with  Total  Correct  Draw 
trials  .73. 
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Means  Analysis . 

Analyses  of  variance  for  SCORE  and  SCORE  3  are  shown  in 
Tables  3.6  and  3.7.  Estimated  variance  components  and  percent  of 
variance  associated  with  each  facet  in  the  SCO  R  E  3  analysis  are 
reported  elsewhere  (see  Lohman.  1979b).  The  data  uere  first 
collapsed  over  the  match  and  type  of  addition  facets,  except  at 
level  one  of  construction,  where  match  was  treated  as 
replications.  Each  cell  thus  contained  four  observations.  Cell 
means  were  then  computed  with  two  means  per  cell  at  level  one  of 
the  construction  facet  and  one  mean  at  levels  two  and  three. 
Replications  within  the  level  one  construction  cells  provided  an 
estimate  MS ( e )  used  to  test  the  s u b j ec t - f ac e t  interactions. 

The  two  analyses  produced  quite  different  results.  In 
particular,  the  linear  contrast  for  stimulus  complexity  was 
highly  significant  in  the  SC0RE3  analysis,  but  nonsignificant  in 
the  SCORE  analysis.  Product  complexity  was  also  highly 
significant  in  the  SC0RE3  analysis  and  only  marginally 
significant  in  the  SCORE  analysis.  On  the  other  hand,  the 
stimulus  complexity  by  product  complexity  interaction  was  much 
stronger  in  the  SCORE  analysis. 

The  major  difference  between  these  two  variables  is  that  a 
subject  could  obtain  credit  for  an  item  under  SCORE  when  he 
guessed,  had  little  faith  in  his  answer  or  took  an  inordinately 
long  time  to  accept  or  reject  the  test  stimulus.  SC0RE3 
attempted  to  remedy  these  deficiencies.  Correlations  between 
total  SCORE,  total  SC0RE3.  and  reference  tests  previously 
examined  (see  Table  3.1)  indicated  that  SC0RE3  was  a  cleaner 
measure  of  spatial  ability.  The  analyses  of  variance  on  the  two 
variables  confirm  this.  SCORE3  was  a  better  measure  of  spatial 
ability  than  SCORE  on  two  counts.  Its  internal  validity,  as 
measured  by  congruence  with  the  model,  and  external  validity,  as 
measured  by  differential  correlations  with  reference  tests,  uero 
both  higher. 

Plots  for  the  construction,  rotation,  stimulus  complexity, 
and  product  complexity  SC0RE3  main  effects  are  shown  in  the 
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Table  3.6 

SCORE  Analysis  of  Variance 


Source 


df  MS  F  '  Source  df 


Mean 

Construction 

Linear 

Quadratic 

Rotation 

Linear 

Quadratic 


1  1051.747 


2921.52* 


N 


29 


(2) 


1 

4.896 

102.01* 

1 

2.044 

40.87* 

(2) 

1 

.994 

16.56* 

1 

.081 

2.32 

NC  (58! 

Linear  29 

Quadratic  29 


NR 


(58] 


Linear  29 
Quadratic  29 


Stimulus  Complexity  (2) 


NS 


(58) 


Linear 

1 

.058 

3.63 

Linear 

29 

Quadratic 

1 

.005 

.14 

Quadratic 

29 

Product  Complexity 

1 

.363 

6.85 

NP 

29 

CR 

4 

.166 

5.01* 

NCR 

116 

CS 

4 

.240 

6.73* 

NCS 

116 

KP 

1 

.063 

1.30 

NKP 

29 

RS 

4 

.048 

1.73 

NRS 

116 

RP 

2 

.023 

.72 

NRP 

58 

SP 

2 

.359 

9.32* 

NSP 

58 

CRS 

8 

.086 

2.86* 

NCRS 

232 

KRP 

2 

.029 

.76 

NKRP 

58 

K.SP 

2 

.620 

14.95* 

NKSP 

58 

RSP 

4 

.254 

7.63* 

NRSP 

116 

KRSP 

4 

.204 

5.78* 

NKRSP 

116 

Residual 

270 

p  less  than  .01 


Table  3.7 


SC0RE3  Analysis  of  Variance 


Source 

df 

MS 

F 

Source 

df 

MS 

F 

Mean 

1 

740.133 

1030.82* 

N 

29 

.718 

- 

Construct  ion 

(2) 

NC 

(58) 

Linear 

l 

14.118 

160.43* 

Linear 

29 

.088 

3.34* 

Quadratic 

1 

4.129 

74.44* 

Quadrat i 

c  29 

.057 

2.17* 

Rotation 

(2) 

NR 

(58) 

Linear 

1 

2.827 

27.19* 

Linear 

29 

.  104 

3.95* 

Quadratic 

1 

.039 

.88 

Quadratic  79 

.  044 

1.67 

Stimulus  Complexity 

(2) 

NS 

(58) 

Linear 

l 

1.387 

33.82* 

Linear 

29 

.041 

1.56 

Quadratic 

1 

.000 

.01 

Quadrat i 

c  29 

.024 

.91 

Product  Complexity 

1 

5.703 

64.80* 

NP 

29 

.088 

3.34* 

CR 

4 

.270 

6.35* 

NCR 

116 

.042 

1.62* 

CS 

4 

.284 

7.33* 

NCS 

116 

.039 

1.47* 

KP 

1 

.029 

.49 

NKP 

29 

.059 

2.24* 

RS 

4 

.046 

1.23 

NRS 

116 

.038 

1.43 

RP 

.  198 

3.97 

NRP 

58 

.050 

1.90* 

sr 

2 

.206 

4.13 

NSP 

58 

.050 

1.90* 

CRS 

8 

.135 

3.49 

NCRS 

232 

.039 

1.47* 

KRP 

2 

.100 

2.83 

NKRP 

58 

.036 

1.35 

KSP 

2 

.789 

15.00* 

NKSP 

58 

.053 

2 . 00* 

RSP 

4 

.209 

4.51* 

NRSP 

1 16 

.  046 

1.76* 

RRSP 

4 

.603 

14.79* 

NKRSP 

116 

.041 

1.55* 

Residual 

270 

.026 

*  ■  p  less  than  .01 
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second  column  of  f tgur*  3  .  .  5  (  b  e  1  o  u  )  .  Both  rotation  and  stimulus 

complexity  show  linear  declines  in  correctness  The  important 
contrast  for  construction  uas  between  no  addition  lie,  C  ■  I) 
and  addition  (C  •  2,3). 

Caaixjal  CarcttUAxga.*- 

Correlations  between  SC0RE3  mam  effect  contrast  scores  and 
reference  tests  are  summarised  in  Table  3.8. 

Correlations  between  the  N  contrast  and  reference  tests  were 
previously  discussed  (see  Table  3.1).  To  recapitulate,  mean 
SC0RE3  scores  correlated  significantly  with  ten  spatial  or  Gf 
reference  tests,  and  no  verbal  tests.  Individual  construction 
linear  and  quadratic  contrast  scores  showed  strong  differential 
correlations  with  several  reference  spatial  tests.  The  largest 
differentials  were  with  Form  Board  and  UAIS  Object  Assembly.  The 
linear  contrast  scores  correlated  .53  and  .56  with  Form  Board  and 
Object  Assembly,  while  the  corresponding  quadratic  contrast 
correlations  were  -.52  and  -.51.  Thus,  high  scores  on  these 
tests  are  reflected  in  a  linear  decline  in  correctness  over  the 
construction  facet. 

The  only  significant  correlation  for  the  rotation  contrast 
scores  was  a  correlation  of  .44  between  the  linear  contrast  and 
Figures.  The  correlation  with  Cards  was  similar  but 
nonsignificant  (  r  *  .  3  Q  )  .  The  rotation  quadratic  component 
correlations  were  not  significant. 

While  the  stimulus  complexity  constrasts  had  no  significant 
correlations  with  reference  tests,  the  product  complexity 
contrast  correlated  negatively  with  two  WA1S  verbal  subtests  and 
the  Verbal  Scale  Score  (r»~.43).  Thus,  large  decrements  in 
performance  over  the  product  complexity  facet  are  more  strongly 
associated  with  verbal  ability  than  spatial  ability. 

E.jntl  xsunLflxv;  A_n&Lv3  vs  el  i4<ui.c.e 

Although  the  confidence  data  were  used  primarily  to  help 
"clean  up"  the  correctness  index,  they  are  of  some  interest  in 
their  own  right.  In  general,  the  analysis  of  confidence  mirrored 
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Correlations  between  SC0RE3  Main  Effect  Contrast  Scores  and  Reference  Tests  (N=30) 


Note.  Decimals  and  correlations  less  than  .42  omitted. 


the  analysis  o{  a  composite  index  of  difficulty  (time  minus 
correctness)  that  is  reported  below. 

The  analysis  of  variance  on  these  scores  is  reported  in 
Lohman  ( 1979b) .  Confidence  decreased  significantly  over  all 
facets  except  Match.  For  rotation  and  stimulus  complexity,  the 
decline  was  linear  (the  quadratic  contrasts  were  not 
significant).  For  construction,  the  decline  was  greater  between 
no  addition  and  one  addition  than  between  one  and  two  piece 
additions. 

There  were  a  number  of  significant  interactions.  However, 
all  were  congruent  with  other  analyses,  particularly  ZDIFF  (see 
below)  and  total  time.  Therefore,  they  will  not  be  discussed 
here.  The  important  finding  here  was  that  immediately  obtained 
self  report  surity  ratings  closely  paralleled  the  more 
"objective"  latency  and  correctness  measures.  Thus,  self  reports 
may  have  considerably  more  validity  than  is  commonly  assumed 
(e.g.,  Nisbitt  and  Uilson,  1977),  especially  when  obtained  as  in 
this  experiment. 

SLarypar is<?n  si.  SC. ore 3  and  iqt.ai  Has  Analyses  . 

The  separate  analyses  on  total  time  and  SC0RE3  clearly  reveal 
that  time  and  error  rate  are  not  interchangeable  aspects  of 
performance. 

Since  the  data  were  collapsed  over  match  and  type  of  addition 
for  the  correctness  analyses,  the  analyses  of  variance  on  total 
time  and  SC0RE3  are  not  totally  comparable.  There  were,  however, 
two  important  differences  between  them. 

1.  Error  rate  was  the  same  at  levels  two  and  three  of 
the  construction  facet,  while  total  time  increased 
linearly  over  the  three  levels  of  construction. 

2.  Interactions  were  fewer  and  smaller  in  the  SCORE 3 
analysis  than  in  the  total  time  analysis. 

The  most  striking  differences  between  the  two  analyses, 
however,  were  in  the  correlations  between  individual  contrast 
scores  and  reference  tests.  These  differences  are  summarized 
below  for  each  contrast. 


tL  Contrasts. 


Mean  SC0RE3  correlated  significantly  uith  ten  spatial  or  Gf 
tests.  Mean  time,  on  the  other  hand,  correlated  significantly 
uith  one  verbal  reasoning  test.  Correlations  uith  several  other 
verbal  tests  approached  significance,  uhile  correlations  uith 
spatial  tests  centered  near  zero. 

Cgnstrvc.ticn. 

For  SC0RE3,  a  linear  decline  uas  associated  uith  high  scores 
and  a  quadratic  decline  uith  lou  scores  on  Form  Board  and  Object 
assembly.  For  total  time,  a  linear  increase  uas  associated  uith 
verbal  ability.  The  quadratic  contrast  scores  for  total  time 
correlated  positively  uith  Marks  Vividness  of  Visual  Imagery 
Questionaire  and  negatively  uith  Hidden  Figures. 

Rotation. 

The  linear  decline  in  SC0RE3  over  the  rotation  facet  uas 
correlated  significantly  uith  only  one  reference  test:  Figures. 
The  correlation  uith  the  Cards  test  uas  similar  but  not 
significant.  For  total  time,  the  magnitude  of  the  linear 
increase  uas  correlated  uith  number  of  errors  on  the  Cards  test. 

Stimulus  Complexity. 

Neither  the  linear  nor  quadratic  SCORE 3  stimulus  complexity 
contrasts  correlated  uith  any  of  the  reference  variables.  Both 
total  time  contrasts,  on  the  other  hand,  correlated  strongly  uith 
verbal  tests . 


Erpduqt  S.P..m  Pi  exit/. 

For  SC0RE3,  large  decrements  in  performance  over  this  facet 
uere  associated  uith  verbal  ability.  For  total  time,  large 
increments  uere  associated  uith  general  ability. 

tla.tc.tl  M  IXfie  SA  Addition. 

Since  data  uere  collapsed  over  these  facets  in  the  SC0RE3 
analysis,  only  total  time  contrast  scores  uere  obtained.  For  the 
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match  facet,  faster  responses  to  positive  test  stimuli  were 
associated  with  high  scores  on  Form  Board  and  Picture  Completion. 
Faster  responses  to  negative  test  stimuli  were  associated  with 
more  errors  on  the  Cards  test  and  higher  scores  on  Visual  Number 
Span  . 

Taking  much  longer  on  left  addition  than  right  additions 
correlated  highly  with  Object  Assembly  and  moderately  with 
several  verbal  tests. 

SJ.Ug.Sl3  r.y.  Sd  lil£  Anal  yses  . 

Correlations  between  the  two  N  contrasts  and  the  reference 
tests  address  the  construct  validity  of  the  test  in  global  terms. 
They  both  answer  the  question.  "What,  in  general,  is  measured  by 
this  task?"  For  SC0RE3,  the  answer  is  clearly  "spatial  ability." 
For  total  time,  it  is  a  more  tentative  "verbal  ability,  if 
anything . " 

Given  these  global  assessments,  the  contrast  correlations 
within  each  facet  look  at  the  relationships  between  the  pattern 
of  scores  over  that  facet  and  the  reference  tests. 

In  general,  the  SC0RE3  contrast  correlations  supported  the 
construct  validity  of  the  task,  while  the  total  time  correlations 
revealed  more  about  the  strategies  or  processes  subjects  used  to 
solve  particular  item  types.  These  correlations  have  already 
been  discussed  in  some  detail  for  the  stimulus  complexity, 
product  complexity,  match  and  type  of  addition  facets. 
Construction  and  rotation,  however,  have  been  examined  only 
briefly.  They  are  the  two  "process"  facets  and  thus  merit  closer 
scrutiny. 

The  differential  correlations  between  the  construction 
contrasts,  and  Form  Board  and  Object  Assembly  support  the 
hypothesis  that  mental  construction  is  an  identifiable  spatial 
skill.  Total  time  contrasts  for  this  facet  reveal  that  those  who 
took  much  longer  to  solve  the  two  addition  items  than  would  be 
predicted  by  their  "basal"  time  on  no  construction  items  were 
probably  reverting  to  verbal  strategies.  Even  more  interesting 
was  the  finding  that  those  who  reported  strong  visual  imagery 
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showed  smaller  increases  in  time  between  two  and  three  piece 
additions  than  between  no  addition  and  one  piece  addition.  On 
the  other  hand,  those  whose  times  increased  the  most  between  two 
and  three  piece  additions  may  have  been  using  a  more  analytic 
spatial  strategy. 

Tor  the  rotation  facet,  the  correlation  between  the  linear 
contrast  scores  and  the  two  rotation  reference  tests,  Cards  and 
Figures,  support  the  construct  validity  of  the  facet.  Large 
increases  in  time  over  the  facet  were  associated  with  number  of 
errors  on  the  Cards  test.  Examination  of  component  times  and 
their  correlations  suggests  that  those  who  made  many  errors  on 
the  Cards  test  may  have  been  using  a  verbal  strategy  to  solve 
some  or  all  of  the  items.  Thus,  once  again  time  reveals 
something  about  process  while  correctness  contrasts  address  the 
construct  validity  of  the  facet. 


IV.  Combined  Anal  yses  an  Total  Time  and  Correctness 
Ike  Advantages  aJ.  Combined  Analyses. 

While  separate  univariate  analysis  are  useful  in  providing 
an  overview  of  the  major  results,  they  have  a  number  of 
disadvantages.  First,  it  is  difficult  to  coordinate  the  results 
of  several  analyses  on  correlated  measures.  More  important, 
however,  is  the  fact  that  such  analyses  may  be  misleading.  This 
is  more  a  product  of  the  experimental  dependence  of  time  and 
correctness  than  of  their  statistical  dependence. 

The  interaction  between  rotation  and  product  complexity 
provides  an  example  of  the  ambiguities  that  arise  when  time  and 
correctness  are  analyzed  separately.  The  interaction  was  highly 
significant  for  total  time,  marginally  significant  for  SCO R E 3 
(correctness  adjusted  for  confidence  and  match  time),  and 
nonsignificant  for  SCORE  (correctness).  Plots  of  the  cell  means 
for  each  of  the  three  dependent  variables  are  shown  in  the  top 
row  of  Figure  3.3.  The  plot  for  total  time  reveals  an  almost 
perfect  linear  relationship  between  time  and  rotation  for  low 
complexity  constructed  stimuli.  However,  it  appears  that,  for 
constructed  stimuli  of  high  complexity,  180  degree  rotations  may 


be  performed  as  quickly  as  90  degree  rotations. 

While  the  plots  for  SCORE  do  not  shed  any  light  on  the 
problem,  the  plot  for  SC0RE3  is  suggestive.  The  latter  plot 
shows  a  linear  decrease  in  correctness  for  high  product 
complexity  items.  Note  also  that  probability  correct  (by  SCORE3 ) 
for  high  product  complexity  is  almost  at  .5  for  90  degree 
rotations  and  then  below  it  for  180  degree  rotations.  This 
suggests  that  a  90  degree  rotation  of  a  complex  constructed 
stimulus  was  about  all  the  average  subject  could  manage.  180 
degree  rotations  of  such  stimuli  were  just  too  difficult,  and  so 
subjects  gave  up  and  resorted  to  guessing. 


Sum  and 


Ss.p.re 


While  such  coordinate  examination  of  error  and  latency  data 
is  useful,  there  is  a  better  way  to  examine  their  joint  and 
independent  effects.  Time  and  correctness  (by  SC0RE3)  were  first 
independently  standardized  to  zero  mean  and  unit  variance.  Two 
scores  were  then  derived  by  taking  the  sum  and  the  difference  of 
these  standard  scores.  ZSUM  was  the  sum  of  the  standard  scores 
for  total  time  and  correctness,  and  ZDIFF  time  minus  correctness. 
ZDIFF  contrasts  fast-accurate  (low  negative)  and  slow-inaccurate 
(high  positive)  performance,  while  ZSUM  contrasts  fast-inaccurate 
(low  negative)  and  slow-accurate  (high-positive)  performance. 
Variations  in  facet  difficulty  or  complexity  should  be  reflected 
in  ZDIFF.  i.e..  fast,  accurate  responses  to  easy  items  and  slow, 
inaccurate  responses  to  difficult  items.  ZSUM,  on  the  other  hand 
reflects  variations  in  speed-accuracy  tradeoff. 

Plots  of  ZDIFF  and  ZSUM  for  the  Rotation  by  Product 
Complexity  interaction  are  shown  in  the  bottom  row  of  Figure  3.3. 
Although  the  quadradic  component  of  the  interaction  is  still 
significant  in  this  picture,  the  slopes  of  the  lines  at  the  two 
levels  of  product  complexity  are  more  nearly  parallel.  Further, 
the  ZSUM  plot  reveals  a  large  shift  in  speed-accuracy  tradeoff 
for  high  product  complexity  items.  Subjects  may  have  given  up 
too  soon  or  mayhave  been  unable  to  perform  these  difficult  items. 
Later  results  favor  the  latter  interpretation. 
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Since  these  tuo  derived  scores  are  statistically 
independent,  each  may  be  analyzed  separately  and  the  results  then 
compared  unambiguously. 

Analyses  of  variance  on  ZSUM  and  ZDIFF  are  presented  in 
Tables  3.9  and  3.10.  Estimated  variance  components  are  reported 
elsewhere  (see  Lohman,  1979b).  In  the  ZDIFF  analysis,  design 
facets  accounted  for  52.3  percent  of  the  variation  in  scores,  and 
subjects  and  their  interactions  with  design  facets  accounted  for 
44.0  percent.  Thus,  item  difficulty  was  a  product  of  both  design 
facets  and  subject  interactions.  For  ZSUM,  on  the  other  hand, 
the  design  facets  accounted  for  only  9.4  percent  of  the  variation 
in  the  data.  Subjects  and  their  interactions  with  design  vectors 
accounted  for  84.9  percent.  Thus,  the  speed-accuracy  tradeoff 
was  only  slightly  influenced  by  design  facets;  rather,  it  was 
largely  a  function  of  subjects  and  how  they  responded  to 
particular  item  types.  These  relationships  are  shown  graphically 
in  Figure  3.4. 

Interpretation  a±  the  Sum  and.  Pi  f  f  erence  Score  Anal  yses 
if  go.ntf.ggt . 

This  contrast  reflects  variability  in  the  subject  means  about 
the  grand  mean.  For  ZDIFF,  the  correlations  with  spatial 
reference  tests  were  mostly  negative  but  nonsignificant.  The 
only  significant  correlation  was  -.50  with  WAIS  Picture 
Arrangement.  The  correlations  with  verbal  tests  were  zero  or 
slightly  positive.  Thus,  except  for  the  WAIS  Picture  Arrangement 
subtest,  fast-accurate  performance  on  this  task  was  only  weakly 
related  to  fast-accurate  performance  on  other  spatial  tasks. 

The  N  contrast  for  ZSUM,  on  the  other  hand,  had  a  number  of 
large  positive  correlations  with  reference  tests:  Total  Draw 
(.66),  Paper  Folding  (.55),  Letter  Series  (.55),  Visual  Humber 
Span  (.54),  Terman  Concept  Mastery  (.52),  Memory  for  Designs- 
Quality  (.51),  WAIS  Information  (.47),  and  Verbal  Scale  Score 
(.44).  Although  verbal  tests  are  represented  on  this  list,  the 
higher  correlations  are  associated  with  complex  spatial  and 
reasoning  tests.  The  pattern  is  congruent  with  Cattell's  general 
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Table  3.10 


ZDIFF  Analysts  of  Variance 
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MS 
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df 

MS 
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- 
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- 
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1 

1099.89 
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fluid  ability  factor  (Gf).  Thus,  the  general  tendency  to  opt  for 
accuracy  over  speed  was  associated  with  Gf  or  general  ability 
(  G  )  . 

Construction . 

Construction  was  the  most  powerful  design  facet.  Both  the 
linear  and  quadratic  effects  were  highly  significant  for  ZDIFF. 

As  can  be  seen  in  Figure  3.5.  the  quadratic  effect  reflects  a 
larger  increase  in  difficulty  between  no  construction  (C=l)  and 
one  piece  construction  (C=2).  than  between  one  and  two  piece 
additions.  Subject  interactions  with  the  linear  term  were  also 
highly  significant,  reflecting  large  individual  differences  in 
slopes.  However,  there  were  no  significant  correlations  between 
the  construction  linear  contrast  and  reference  constructs. 
Individual  scores  for  the  quadratic  construction  contrast, 
however,  correlated  .51.  .43,  .54  and  .45  with  Form  Board,  total 

number  of  correct  draw  trials,  WAIS  Object  Assembly,  and  WAIS 
Performance  Scale  Score,  respectively.  Thus  those  subjects  who 
made  relatively  more  errors  and  had  relatively  longer  latencies 
on  level  3  construction  items  than  on  level  2  items  did  better  on 
the  draw  trials  and  on  these  reference  tests.  In  other  words, 
subjects  who  "topped  out"  at  level  2  construction  items  tended  to 
have  lower  scores  on  these  three  variables. 

A  plot  of  the  ZSUM  means  over  the  construction  facet  is  also 
shown  in  Figure  3.5.  Since  subject  interactions  with  both  the 
linear  and  quadratic  components  were  large,  the  plot  is  not 
representative  of  individual  data.  The  most  striking 
relationships  here  were  the  high  positive  correlations  between 
the  linear  contrast  scores  and  verbal  and  memory  span  reference 
constructs,  and  high  negative  correlations  between  quadratic 
contrast  scores  and  spatial  reference  constructs.  Further,  the 
two  contrasts  had  opposite  correlations  with  Total  Draw,  .60  for 
the  linear  contrast  and  -.63  for  the  quadratic  contrast.  The 
quadratic  contrast  correlations  indicate  that  those  who  showed  a 
much  larger  shift  in  speed-accuracy  tradeoff  between  one  and  tw'< 
piece  additions  than  between  zero  and  one  piece  additions 
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Figure  3.5.  Plots  of  main  effect  cell  means  for  total  time,  SC0RF3, 


ZDIFF,  and  ZSUM. 


.wwjnaiia 


performed  poorly  on  the  reference  spatial  tests  and  the  draw 
trials.  On  the  other  hand,  large  shifts  between  no  construction 
and  two  piece  additions  were  associated  with  verbal  and  verbal 
memory  span  tests. 

Eo.ta  Li-og. 

Tor  ZDIFF,  the  linear  contrast  was  highly  significant,  and 
the  quadratic  contrast  only  weakly  significant.  Figure  3.5 
reveals  the  nearly  perfect  linear  relationship  between  ZDIFF  and 
rotation.  Similarly,  subject  interactions  with  the  linear 
component  were  strong,  while  the  subjects  by  rotation  quadratic 
term  was  only  marginally  significant.  There  were  no  significant 
correlations  between  either  contrast  and  the  reference  tests. 

Again  the  ZSUM  contrasts  had  several  noteworthy 
correlations.  The  linear  and  quadratic  contrasts  corelated  .45 
and  -.22  with  number  of  errors  on  the  Cards  tests.  Corresponding 
correlations  with  the  Total  Draw  score  were  .44  and  -.02.  Thus, 
those  who  shifted  to  slower,  more  accurate  responding  for  180 
degree  rotations  did  better  when  they  had  to  draw  the  answer,  but 
made  more  errors  on  the  multiple  choice  Cards  test. 

S-li.muJys  Complex:  ty  . 

For  ZDIFF,  only  the  linear  contrast  and  subjects  by  linear 
contrast  were  significant.  The  straight  line  plot  is  shown  in 
Figure  3.5.  It  is  noteworthy  that  the  analyses  of  total  time 
suggested  significant  nonlinear  trends  for  stimulus  complexity 
and  rotation.  The  plots  became  almost  perfectly  linear  only  when 
latency  and  correctness  were  analyzed  jointly.  The  linear 
contrast  had  only  two  significant  correlations:  .53  with  WAIS 

Verbal  Scale  Score,  and  .67  with  WAIS  Digit  Span  Backwards.  The 
quadradic  component  correlated  .48  with  Visual  Humber  Span  and 
.55  with  WAIS  Digit  Span  Forward.  Together,  these  correlations 
suggest  that  those  who  showed  the  strongest  effects  for  stimulus 
complexity  may  have  been  using  verbal  strategies  to  encode  and 
remember  the  stimuli. 

The  plot  for  ZSUM  on  stimulus  complexity  is  shown  in  Figure 
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3.5.  Again,  only  the  linear  and  subjects  by  linear  contrasts 
were  significant.  As  with  ZDIFF,  correlations  with  reference 
tests  again  suggest  a  connection  with  verbal  ability.  Both 
linear  and  quadratic  contrast  scores  correlated  .45  with  UAIS  IQ. 
Only  the  linear  contrast  had  other  significant  correlations. 

Four  were  with  verbal  tests  (Word  Beginnings  and  Endings,  .50; 
WAIS  Comprehension,  .43,  Vocabulary,  .47,  and  Verbal  Scale  Score, 
.48)  while  two  were  with  spatial  tests  (Cards  Wrong,  .50;  and 
WAIS  Object  Assembly.  .44).  However,  Cards  Wrong  is  probably  a 
better  index  of  verbal  strategies  than  spatial  ability.  Thus, 
large  shifts  toward  slower,  more  accurate  performance  over  the 
rotation  facet  were  most  strongly  related  to  verbal  ability  and 
verbal  strategies. 

PrvfluQt  Complexity. 

As  shown  in  Table  3.10,  product  complexity  and  subjects  by  . 
product  complexity  were  highly  significant  in  ZDIFF  analysis. 

Both  errors  and  latency  were  greater  for  high  complexity 
constructed  stimuli. 

Contrast  scores  for  product  complexity  were  the  best 
predictor  of  performance  on  the  reference  tests.  High 
correlations  with  both  verbal  and  spatial  tests  implicate  general 
ability.  In  particular,  the  performance  of  high  ability  students 
was  influenced  by  the  complexity  of  the  to-be-constructed 
stimulus.  This  is  most  easily  explained  by  assuming  that  high 
ability  students  were  able  to  combine  the  stimuli  mentally  as 
directed,  (using  verbal  strategies  spatial  strategies  or  both) 
while  low  ability  students  were  not  able  to  generate  a  clear 
image  of  the  constructed  stimulus. 

This  result  is  even  more  interesting  in  the  light  of  the 
ZSUM  analysis.  While  the  subjects  by  product  complexity 
interaction  was  significant  for  ZSUM,  the  main  effect  was  not. 
However,  none  of  the  contrast  correlations  were  significant. 
Hence,  the  product  complexity  effect  is  not  a  reflection  of 
systematic  changes  in  speed-accuracy  tradeoff. 
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Be.Len.gg  &±  Sm"?  anj  Difference  Scores 

The  propriety  of  any  index  depends  on  the  question  that  is 
asked.  Thus,  questions  about  the  relationships  between  latency 
or  correctness  and  the  design  facets  or  reference  tests  are  best 
addressed  by  separate  analyses  on  the  two  measures. 

But,  for  example,  reading  and  mathematics  achievement  test 
scores  may  be  usefully  combined  to  represent  two  new  variables, 
namely,  general  scholastic  achievement,  and  differential 
achievement  in  reading  and  mathematics.  Verbal  and  spatial 
scores  also  may  be  summed  to  represent  general  abilty  and 
subtracted  to  provide  a  special  ability  contrast.  Similarly, 
latency  and  correctness  can  be  combined  to  form  new  indices.  The 
major  difference  between  the  separate  scores  and  the  two 
combination  indices  lies  in  their  interpretation.  Further, 
unlike  the  other  examples  cited  above,  latency  and  correctness 
are  not  only  statistically,  but  also  experimentally  dependent. 
This  also  affects  their  interpretation. 

The  difference  score  (time  minus  correctness)  is  best 
interpreted  as  a  composite  index  of  difficulty,  while  the  sura 
represents  both  the  speed-accuracy  tradeoff  and  the  independent 
contributions  of  the  two  variables. 

If  latency  and  correctness  have  opposite  patterns  over  a 
facet  (e.g.»  latency  increases  linearly  while  correctness 
decreases  linearly),  then  the  means  analysis  on  the  difference 
score  will  show  a  larger  effect  for  the  facet  than  either  of  the 
separate  analyses.  Similarly,  the  sum  score  will  be 
nonsignificant  over  the  facet,  reflecting  the  fact  that  the 
patterns  are  the  same,  but  opposite,  for  the  two  variables. 

For  the  correlational  analysis,  if  both  latency  and 
correctness  correlate  positively  with  a  given  test,  then  the  sum 
score  correlation  will  be  higher,  and  the  difference  score 
correlation  zero.  If  latency  correlates  with  verbal  tests  and 
correctness  with  spatial  tests,  then  the  sum  will  correlate  with 
both  and  the  difference  with  neither  cluster  of  tests.  In  such 
cases,  separate  analyses  on  latency  and  correctness  provide 
greater  insight. 
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However,  there  are  times  when  the  sum  and  difference  scores 
provide  unique  information.  Several  puzzling  interactions 
appearing  in  the  separate  analyses  were  eliminated  or  attenuated 
in  the  combined  analyses  (e.g.,  see  Figure  3.3).  Similarly,  both 
rotation  and  stimulus  complexity  showed  nonlinear  trends  in  the 
total  time  analysis,  but  were  linear  in  the  ZDIFF  analysis. 

Thus,  "difficulty”  increased  linearly  over  these  facets,  while 
time  and  correctness  had  nonlinear,  but  opposite,  quadratic 
effects. 

Similarly,  correlations  between  the  sum  and  difference  scores 
contrasts  were  often  quite  unlike  the  contrast  correlations  in 
the  separate  analyses.  For  example,  in  the  analysis  on 
construction  component  time  and  construction  SCORE3  (see  below), 
the  type  of  addition  contrast  for  component  time  correlated 
significantly  with  seven  verbal  tests  (r  3  .41  to  .56)  and  Object 
Assembly  (r  =  .45).  The  SCORE3  type  of  addition  contrast 
correlated  with  Harks  Vividness  of  Visual  Imagery  Questionaire  (r 
3  .49)  and  negatively  with  Visual  Number  Span  (r  =  -.45). 

However,  the  sum  score  correlated  with  Form  Board  (r  3  .43)  and 
Object  Assembly  (r  3  .46),  while  the  difference  score  correlated 
negatively  with  Harks  Questionaire  (r  3  -.56)  and  positively  with 
Visual  Number  Span.  Thus,  the  verbal  tests  dropped  from  the 
picture  and  the  two  tests  most  similar  to  the  experimental 
construction  task  (Form  Board  and  Object  Assembly)  became  the 
only  two  tests  correlated  with  the  sum  score.  The  sum  score 
correlations  occurred  because  Form  Board  and  Object  Assembly  were 
positively  correlated  with  both  the  time  contrast  (r  3  .32  and 
.45)  and  the  SC0RE3  contrast  (r  =  .32  and  .28),  although  the 
three  smaller  correlations  were  not  significant.  On  the  other 
hand,  the  SC0RE3  type  of  addition  contrast  had  zero  or  slightly 
negative  correlations  with  the  verbal  tests.  Thus,  when  SC0RE3 
and  time  were  added  together,  the  two  tests  positively  correlated 
with  both  the  time  and  correctness  contrasts  emerged  with  larger 
correlations,  while  the  verbal  tests  correlations  became 
nonsigni f icant . 
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Summa  r  v  o  f  the  Sum  and  D i f  f e  r enc  e  Score  Analyses 
The  analyses  of  variance  on  ZSUM  and  ZDIFF  differed  most 
significantly  in  the  uay  score  variation  was  accounted  for  by 
facets  and  subject  interactions  uith  facets.  In  particular,  the 
ZDIFF  analysis  revealed  that  item  difficulty  (on  a  scale  of  fast- 
accurate  to  s 1 ou- inaccu r a t e )  was  a  function  of  both  facets  and 
subject  interactions  uith  facets.  The  speed-accuracy  tradeoff 
represented  by  ZSUM,  on  the  other  hand,  was  only  slightly 
influenced  by  design  facets.  Rather,  variation  in  these  scores 
was  largely  a  function  of  subject  interactions  uith  design 
facets  . 

Most  of  the  relationships  between  ZSUM  and  ZDIFF  contrast 
scores  and  reference  tests  are  congruent  uith  the  previous 
analyses  on  total  time  and  SC0RE3.  There  are  important 
differences,  however,  and  these  are  noted  below. 


tt.  Contrast . 

For  ZDIFF,  generally  fast  and  accurate  performance  on  this 
task  was  weakly  related  to  similar  performance  on  other  spatial 
tasks,  and  strongly  indicative  of  high  scores  on  the  UAIS  Picture 
Arrangement  subtest.  There  was  no  relationship,  or,  if  anything, 
an  inverse  relationship  between  ZDIFF  and  performance  on  Verbal 
tests. 

For  ZSUM,  the  general  tendency  to  opt  for  accuracy  over  speed 
was  associated  uith  Gf. 


Construction. 

As  in  the  SC0RE3  analysis,  nonlinear  increases  in  ZDIFF  were 
positively  related  to  Form  Board  and  Object  Assembly.  Those  who 
did  not  take  more  time  and  make  more  errors  on  two  piece 
additions  than  on  one  piece  additions  performed  poorly  on  two 
reference  construction  tests. 

One  explanation  for  this  phenomenon  would  be  that  those  who 
performed  poorly  on  these  construction  tasks  did  not  fully 
appreciate  the  increased  difficulty  of  the  two  piece  additions, 
and  thus  did  not  really  try  harder  on  the  more  difficult 


additions.  However,  the  ZSUM  contrast  correlations  argue  against 
this  hypothesis.  Large  shifts  in  speed-accuracy  between  levels 
two  and  three  on  the  construction  facet  were  also  related  to  poor 
performance  on  the  reference  tests.  Therefore,  a  better 
explanation  of  the  ZDIFF  quadratic  correlations  is  that  those  who 
did  not  show  an  increase  in  error  rate  and  time  when  shifting 
from  one  to  two  piece  additions  had  reached  their  limit  on  the 
one  piece  additions. 

Finally,  the  ZSUM  linear  correlations  related  large  shifts  in 
s peed -accu r ac y  trade  off  between  no  construction  and  two 
additions  to  verbal  ability. 

RalalLan • 

Only  the  ZSUM  linear  contrast  had  significant  correlations 
for  the  rotation  facet.  Large  shifts  toward  slower,  more 
accurate  performance  were  related  to  number  of  errors  on  the 
Cards  test  and  Visual  Humber  Span. 

SJLinm.Ln.a.  complexity. 

The  ZSUM  and  ZDIFF  analyses  indicated  that  those  who  showed 
the  largest  effects  for  this  facet  may  have  been  using  verbal 
strategies  i<*  encode  and  remember  the  stimuli.  The  major 
difference  between  these  correlations  and  those  for  the  same 
contrast  in  the  total  time  analysis  was  the  focus  on  verbal  sho: t 
term  memory  tests  in  the  ZDIFF  analysis. 

Emln-gl  c.sunal.«sLt.y  • 

The  most  significant  contribution  of  the  ZSUM.  ZDIFF  analyses 
here  was  the  finding  that  the  product  complexity  effect  was  not  a 
function  of  systematic  changes  in  speed-accuracy  trade-off. 

1-  Sam&gntnt  Hat  AjuJLxutt 

At  least  two  component  times  (encode,  match)  and  as  many  as 
four  component  times  (encode,  add.  rotate,  match)  were  collected 
for  each  item.  Encoding  and  match  times  were  available  for  all 
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216  items,  construction  and  rotation  times  for  144  items.  Since 
construction  aluays  preceded  rotation,  rotation  component  times 
for  the  96  items  that  also  required  construction  uere  not 
experimentally  independent  of  the  construction  operations. 
Rotation  time  on  these  items  depended  not  only  on  the  success  of 
the  rotation,  but  also  on  the  success  of  the  preceding  addition. 
Similarly,  only  24  match  times  uere  not  preceded  by  construction 
or  rotation.  Therefore,  the  analyses  of  the  rotation  and  match 
component  times  uere  necessarily  more  limited  than  those  on 
encoding  and  construction. 


Encod i no 

Means  Analysis- 

Stimulus  complexity  and  practice  (or  block)  uere  the  only 
tuo  factors  represented  in  the  design  likely  to  affect  encoding. 
The  analysis  of  variance  of  encoding  time  by  these  factors  is 
reported  in  Table  3.11. 

Only  the  linear  components  of  both  stimulus  complexity  and 
block  uere  significant.  Cell  means  uere  4.02,  5.41,  and  6.92  sec 
for  lou,  medium,  and  high  complexity  stimuli,  respectively.  Cell 
means  for  the  four  blocks  uere  6.71,  5.33,  and  5.25.  and  4.49 
sec . 

There  uere  significant  individual  differences  in  both  linear 
contrasts  and  in  the  nonlinear  block  contrast.  Subjects  also 
interacted  significantly  uith  the  linear  by  linear  contrast. 

C<?n.Uft5,t  Correlations 

&  Contrast .  The  first  important  finding  is  a  negative  one: 
there  uere  no  relationships  betueen  average  encoding  time  and  any 
of  the  reference  tests  or  total  scores.  The  correlation  betueen 
mean  encoding  time  and  IQ  uas  -.03.  The  only  significant 
correlation  (r*.48)  uas  uith  number  of  marks  made  on  the  Form 
Board  test,  a  strategy  index.  Those  who  made  more  marks  on  the 
stinuli  in  the  Form  Board  test  spent  more  time  studying  the 
stimuli  in  this  experiment. 

Stimulus  C omo laxity  The  stimulus  complexity  linear 
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Table  3.11 

Analysis  of  Variance  for  Encoding  Component  Time 


Source 


df  MS 


Source 


Mean  1  10683.44 

Stimulus  Complexity  (2) 


Linear 

Quadratic 

Block 

Linear 

Nonlinear 

SB 

Lin  x  Lin 
Qd  x  Nln 


1  503.07 

1  .32 

(3) 

1  203.11 

2  26.46 

(6) 

1  2.81 

5  .82 


176.08 

69.90* 

.48 

18.33* 

2.05 


Linear 


Linear  29 
Nonlinear  58 
SB  (174) 

Lin  x  Lin  29 
Qd  x  Nln  145 


11  .SO* 

1.11 

18.16* 

10.57* 

2.50* 


*MS(e)  estimated  by  the  NS  (.quadrat  ic)B  (nonlinear)  interaction. 
*p  less  than  .01 


contrast  had  significant  correlations  with  UAIS  Digit  Span 
(r=.47),  Vocabulary  (r=.44),  Verbal  Scale  Score  (.49),  and  thus 
IQ  (.43).  The  correlation  uith  UAIS  Performance  Scale  Score  was 
a  nonsignificant  .27.  Only  one  spatial  test.  Paper  Folding, 
correlated  significantly  uith  the  contrast  (r=.47).  There  was 
also  an  interesting  differential  set  of  correlations  uith  the  two 
correctness  indices:  r=.43  uith  total  SCORE,  but  only  .14  uith 
total  SC0RE3. 

The  quadradic  component  had  even  higher  correlations  uith 
verbal  tests.  Correlations  uith  all  UAIS  verbal  subtests  except 
Arithmetic  were  significant,  and  the  correlation  uith  Verbal 
Scale  Score  uas  .58.  Other  significant  correlations  were  total 
correct  Draw  trials  (.52),  Visual  Number  Span  (.44),  Number 
Comparison  (.43),  and  Camouflaged  Uords  (.54). 

These  quadratic  component  correlations  mean  that  those  uho 
shoued  greater  increases  in  encoding  time  between  medium  and  high 
complexity  than  betueen  low  and  medium  complexity  tended  to  score 
higher  on  verbal  tests  than  those  whose  times  increased  linearly 
or  by  the  reverse  pattern. 

Strategy  reports  provide  a  good  explanation  of  this 
phenomenon.  Some  subjects  reported  that  they  tried  to  remember 
complex  stimuli  by  seeing  them  as  composites  of  simpler  forms 
such  as  triangles  and  squares.  Both  additive  and  subtractive 
versions  of  this  strategy  were  reported  ,  i.e.,  "a  triangle  on 
top  of  a  square, "  and  "a  square  uith  a  triangle  cut  out  of  the 
top .  " 

Subjects  using  this  strategy  would  be  much  faster  on  low  and 
medium  complexity  stimuli  (3-4  points)  since  most  of  these 
stimuli  can  be  immediately  labelled,  even  when  irregular  (e.g.  ”a 
triangle  leaning  to  the  right”).  Host  high  complexity  stimuli 
were  not  so  familiar,  and  so  the  process  of  decomposing  then  into 
simpler  shapes  or  looking  for  some  concrete  association  uould 
have  taken  longer. 


Hi*  Cain  leus  li-gn  iLgriE.fln.tal 

Of  the  total  2)6  items.  had  a  construction  component 
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Construction  component  times  were  relatively  "clean"  since 
rotation,  if  required,  always  followed  construction.  Data  were 
collapsed  over  the  rotation  facet,  and  mean  time  and  number 
correct  within  each  of  these  24  cells  were  then  used  in  the 
analyses.  Note  that  the  construction  facet  has  only  two  levels 
her  e . 

Separate  analyses  of  variance  were  performed  on  time  and 
correctness  (by  SC0RE3)  and  are  reported  in  Lohman  (1979b).  The 
major  differences  between  the  two  analyses  were: 

1.  Construction  (K)  was  a  highly  significant  facet  in 
the  time  analysis  (F=118.79),  but  not  significant 
in  the  SC0RE3  analysis  (F*.44>.  Thus,  subjects 
took  much  longer  to  add  two  stimuli  to  a  given  base 
than  to  add  one.  However,  they  did  not  make 
relatively  more  errors  on  the  two  piece  additions. 

2.  The  KP,  ST,  and  KSTP  interactions  were  highly 
significant  in  the  time  analysis  but  not 
significant  in  the  SC0RE3  analysis. 

3.  In  the  KS  and  SP  interactions,  the  linear  component 
of  the  contrast  was  highly  significant  in  one 
analysis  and  the  quadratic  component  significant  in 
the  other  analysis. 

As  in  previous  analyses,  time  and  correctness  were  analyzed 
jointly  by  computing  the  sum  and  difference  of  the  independently 
normalized  scores.  These  variables  were  called  CSUM  and  CDIFF. 
Analyses  of  variance  on  each  are  reported  in  Tables  3.12  and 
3.13.  Cell  means  are  reported  elsewhere  (see  Lohman,  1979b). 

In  the  CDIFF  analysis,  all  main  effects  and  most  facet 
interactions  were  significant,  while  only  the  NK.  NP,  NKP,  and 
NKTP  s ub j ec t- f ac e t  interactions  were  significant.  On  the  other 
hand,  construction  (K)  and  product  complexity  were  the  only 
significant  main  effects  in  the  CSUH  analysis.  Most  facet 
interactions  were  significant,  but  generally  smaller  than  their 
counterparts  in  the  CDIFF  analysis.  Large  s u b : ec t - f a c e t 
interactions  uere  obtained  for  K,  S  linear,  and  P. 
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Table  3.12 

CSUM  Analysis  of  Variance 


Source 

df 

MS 

F 

Source 

df 

MS 

F 

Mean 

1 

.000 

- 

N 

29 

19.674 

- 

Construction  (K) 

1 

122.776 

81.69* 

NK 

29 

1.503 

2.90* 

Stimulus  Complexity 

(2) 

MS 

(58) 

Linear 

1 

.994 

.78 

Linear 

29 

1.278 

2.46* 

Quadratic 

1 

.388 

2.04 

Quadratic 

29 

.435 

.84 

Type  of  Addition 

1 

.376 

.47 

NT 

29 

.801 

1.54 

Product  Complexity 

1 

.248 

.12 

NP 

29 

2.023 

3.90* 

KS 

(2) 

NKS 

(58) 

Linear 

1 

7.057 

15.86* 

Linear 

29 

.445 

.86 

Quadratic 

1 

.000 

.00 

Quadratic 

29 

.669 

1.29 

KT 

1 

.143 

.15 

NKT 

29 

.938 

1.81* 

KP 

1 

16.119 

17.43* 

NKP 

29 

.925 

1.78 

ST 

(2) 

i 

NST 

(58) 

Linear 

1 

.571 

1.05 

Linear 

29 

.545 

1.05 

Quadratic 

1 

.721 

.93 

(fciadratic 

29 

.773 

1.49 

SP 

(2) 

NSP 

(58) 

Linear 

1 

13.913 

16.99* 

Linear 

29 

.819 

1.58 

Quadratic 

1 

4.152 

6.19 

Quadratic 

29 

.671 

1.29 

TP 

1 

1.360 

2.99 

NTP 

29 

.622 

1.20 

KST 

(2) 

NKST 

(58) 

Linear 

1 

.404 

l.oi  i 

Linear 

29 

.400 

.77 

Qiadratic 

1 

4.728 

11.07* 

Quadratic 

29 

.427 

.82 

KSP 

(2) 

NKSP 

(58) 

Linear 

1 

3.347 

5.86 

Linear 

29 

.571 

1.10 

Qiadratic 

1 

2.902 

3.50 

Quadratic 

29 

.328 

1.60 

KIP 

1 

7.864 

7.49 

NKTP 

29 

1.050 

2.02* 

STP 

(2) 

NSTP 

(58) 

Linear 

1 

./58 

1.19 

Linear 

29 

.639 

1.23 

Quadratic 

1 

3.675 

4.65 

Quadratic 

29 

.790 

1.52 

KSTP 

(2) 

NKSTP 

(58) 

Linear 

1 

4.586 

9.38* 

Linear 

29 

.489 

- 

(Vadratlc 

1 

1.236 

2.25 

ad  ratio 

29 

.549 

- 

Ns(e)  estimated  by  NKSTP  interaction. 
*p  !•••  than  .01 
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Table  3.13 

CDIFF  Analysis  of  Variance 


Source 

df 

MS 

F 

Source 

df 

MS 

Fa 

Mean 

1 

.000 

- 

N 

29 

11.188 

- 

Construction  (K) 

1 

140.747 

97.81* 

NC 

29 

1.439 

2.44* 

Stimulus  Complexity 

(2) 

NS 

(58) 

Linear 

l 

70.411 

96.98* 

Linear 

29 

.726 

1.23 

Quadratic 

1 

5.599 

11.13*  ! 

Quadratic 

29 

.503 

.85 

Type  of  Addition 

1 

42.912 

53.91* 

NT 

29 

.796 

1.35 

Product  Complexity 

1 

253.171 

109.17* 

NP 

29 

2.319 

3.94* 

KS 

(2) 

NKS 

(58) 

Linear 

1 

8.619 

15.23* 

Linear 

29 

.566 

.96 

Quadratic 

1 

6.459 

16.69* 

Quadratic 

29 

.387 

.66 

K3 

1 

33.391 

51.45* 

NKT 

29 

.649 

1.10 

KP 

1 

8.619 

8.16* 

NKP 

29 

1.056 

1.79 

ST 

(2) 

NST 

(58) 

Linear 

1 

17.511 

26.57* 

Linear 

29 

.659 

1.12 

Quadratic 

l 

.192 

.29 

Quadratic 

29 

.666 

1.13 

SP 

(2) 

NSP 

(58) 

Linear 

1 

16.251 

23.52* 

Linear 

29 

.691 

1.17 

Quadratic 

1 

4.401 

5.65 

Quadratic 

29 

.779 

1.32 

TP 

1 

28.227 

38.61* 

NTP 

29 

.731 

1.24 

KST 

(2) 

NKST 

(58) 

Linear 

1 

13.187 

24.11* 

Linear 

29 

.547 

.93 

t^iadratic 

1 

.499 

.82 

Quadratic 

29 

.612 

1.04 

KSP 

(2) 

NKSP 

(58) 

Linear 

1 

7.203 

13.72* 

Linear 

29 

.525 

.89 

Quadratic 

1 

25.007 

32.23* 

Quadratic 

29 

.776 

1.32 

KTP 

l 

25.560 

21.48* 

SKIP 

29 

1.190 

2.02 

STP 

(2) 

NSTP 

(58) 

Linear 

1 

1.532 

2.02 

Linear 

29 

.760 

1.29 

(^2 ad  rat  lc 

1 

5.217 

7.46 

Quadratic 

29 

.699 

1.19 

KSTP 

(2) 

NXSTP 

1 

(58) 

Linear 

1 

.001 

.00 

Linear 

29 

.672 

- 

Quadratic 

l 

7.530 

14.88* 

Quadratic 

29 

.506 

*MS(e)  e  Jt  Lmated  by  NKSTP  interaction. 


*p  lea*  than  .01 
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Construction  Facet ■ 


As  previously  noted,  K  uas  a  powerful  facet  in  the  component 
time  analysis,  but  not  significant  in  the  SC0RE3  analysis  of 
variance.  This  emerged  as  a  large  main  effect  in  both  the  CSUM 
and  CDIFF  analyses.  The  latter  reflects  the  fact  that  two  piece 
additions  were  more  difficult  than  one  piece  additions.  The  CSUM 
main  effect  reflects  the  large  changes  in  s peed - ac c u r a c y  tradeoff 
over  the  facet.  Subjects  maintained  a  constant  error  rate  by 
taking  an  average  of  4.56  sec  longer  to  solve  the  two  piece 
additions . 

Individual  contrast  scores  on  CDIFF  correlated  significantly 
with  Cards  Wrong  (.48),  total  correct  Draw  trials  (.49), 

Necessary  Arithmetic  Operations  (.43),  Terman  Concept  Mastery 
(.55),  WAIS  Information  (48),  Vocabulary  (46),  and  Verbal  Scale 
Score  (.49).  Thus  high  verbals  were  most  drastically  affected  by 
the  shift  from  one  to  two  piece  additions. 

The  CSUM  contrast  scores  had  only  one  significant 
correlation:  .56  with  Visual  Number  Span  Longest  Correct. 

Perhaps  those  good  at  remembering  numbers  tend  to  be  poor  at 
remembering  figures. 

Stimulus  Complexity. 

Both  linear  and  quadratic  stimulus  complexity  contrasts  were 
significant  in  the  CDIFF  analysis.  The  latter  reflects  a  greater 
increase  in  difficulty  between  medium  and  high  than  between  low 
and  medium  complexity.  Neither  contrast  was  significant  in  the 
CSUM  analysis. 

Individual  CDIFF  linear  contrast  scores  correlated 
significantly  with  a  cluster  of  tests  similar  to  those  found  on 
factors  called  Reasoning  in  the  factor  analytic  literature.  The 
tests  and  their  correlations  were:  Necessary  Arithmetic 
Operations  (.43),  WAIS  Arithmetic  (.50),  Digit  Span  Backwards 
(.47),  and  Verbal  Scale  Score  (.45).  On  the  other  hand,  the 
CDIFF  quadratic  contrast  scores  correlated  most  strongly  with 
spatial  reference  tests.  Seven  of  ten  significant  correlations 
were  with  spatial  tests.  Thus,  for  subjects  with  high  spatial 
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scores,  additions  involving  medium  complexity  stimuli  were  only 
slightly  more  difficult  than  those  involving  lou  complexity 
stimuli.  Additions  of  high  complexity  stimuli  uere  much  more 
difficult. 

For  CSUM,  the  linear  contrast  scores  correlated  with  Word 
Beginnings  and  Endings  (.55),  Uses  for  Things  (.51),  and  UAIS 
Vocabulary  (.44).  Once  again,  large  changes  in  speed-accuracy 
tradeoff  over  the  stimulus  complexity  factor  are  associated  with 
verbal  ability,  this  time  with  a  cluster  more  properly  called 
Word  Fluency  or  Verbal  Productive  Thinking. 

Iz&s  si  Mii-iioji. 

While  the  main  effect  indicates  that  adding  to  the  left  uas 
generally  more  difficult  than  adding  to  the  right,  the  KT 
interaction  reveals  that  this  difference  held  only  for  tuo  piece 


add i t i ons 


Contrast  scores  on  CDIFF  for  this  factor  correlated 


-.56  with  Marks  Vividness  of  Visual  Imagery  Questionaire  and  .56 
with  Visual  Number  Span.  Thus,  those  who  had  the  most  difficulty 
adding  to  the  left  reported  poor  visual  imagery  but  had  good 
short  term  recall  for  numbers. 

P C.Vdlig t  Complexity. 

Once  again  product  complexity  emerged  as  a  powerful  factor, 
both  in  the  CDIFF  analysis  of  variance  and  the  CDIFF  contrast 
correlations.  Additions  that  yielded  high  complexity  products 
were  much  more  difficult  than  those  that  combined  to  form  low 
complexity  products.  This  held  for  both  time,  correctness,  and 
the  CDIFF  score. 

CDIFF  contrast  scores  correlated  with  complex  spatial  tests 
(Paper  Folding,  Surface  Development),  Gf  or  G  tests  (Hidden 
Figures,  Necessary  Arithmetic  Operations,  Leter  Series,  Terman 
Concept  Mastery,  WAIS  Block  Design),  and  the  WAIS  Verbal  tests. 
Thus,  the  effect  once  again  appears  to  be  associated  with  general 
ability.  High  ability  students  took  more  time  and  made  more 
errors  on  high  than  on  low  product  complexity  additions. 

The  only  significant  CSUM  correlation  was  with  an  "internal" 
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r^T' 


variable:  Total  Correct  Drau  (r».44).  This  reflects  the  obvious 

conclusion  that  those  uho  were  more  careful  on  high  complexity 
additions  uere  better  able  to  drau  them  uhen  requested  to  do  so. 

Ih.9.  M  t  fcal  SLtll  Totals  ■ 

The  CDIFT  N  contrast  reflects  subjects’  relative  positions  on 
a  f as t -accu r a t e  to  s 1 ou- i naccu r a t e  scale.  The  only  significant 
reference  test  correlations  for  this  contrast  uere  -.44  uith 
Figures  and  -.58  uith  UAIS  Picture  Arrangement.  Thus,  fast- 
accurate  construction  uas  associated  uith  high  scores  on  these 
tuo  tests. 

The  CSUH  H  contrast  scores,  on  the  other  hand,  correlated 
uith  a  number  of  reference  tests.  The  tests  and  their 
correlations  uith  CSUH  are  shoun  in  the  first  column  of  Table 
3.14.  High  scores  on  these  tests  are  associated  uith  slou- 
accurate  responding  in  this  task.  Given  the  preponderance  of 
verbal  tests  in  the  list,  it  uould  appear  that  high  CSUH  scores 
uere  more  associated  uith  verbal  ability  (or  Gc)  than  spatial 
ability  (or  Gf). 

The  N  contrast  scores  for  the  independent  analyses  on  the 
construction  component  time  and  SC0RE3  uere  even  more  revealing. 
These  contrasts  represent  average  construction  component  time  and 
average  SC0RE3  for  each  subject.  As  shoun  in  Table  3.14,  average 
construction  time  correlated  uith  verbal  tests,  uhile  average 
SCORE 3  correlated  uith  spatial  tests. 

Mean  time  and  SC0RE3  uithin  each  of  the  24  cells  of  the 
construction  component  design  uere  also  correlated  uith  the 
reference  tests.  Six  of  these  component  times  correlated  uith 
MAIS  Verbal  Scale  Score.  All  uere  for  the  more  difficult  cells, 
i.e.,  high  stimulus  complexity  (four  of  six),  high  product 
complexity  (all  six),  tuo  piece  additions  tfour  of  six),  and  left 
additions  (four  of  six).  Cell  means  for  the  easier  items  had 
small  positive  correlations  uith  verbal  tests  and  zero  or  slight 
negative  correlations  uith  spatial  tests.  However,  only  tuo 
Closure  Speed  test  correlations  uere  significant  (see  below). 

For  the  SCORE 3  cell  totals,  only  four  of  the  24  cells 
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Table  3.14 

Correlations  between  Construction  CSUM,  Component  Time 
and  SC0RE3  N  Contrasts  and  Reference  Tests  (N«30) 


Reference  Test 

CSUM 

Time 

SC0RE3 

Paper  Folding 

56 

67 

Form  Board 

55 

Surface  Development 

48 

63 

Memory  for  Designs  -  Quality 

58 

72 

Total  Correct  Draw  Trials 

76 

51 

73 

Copying 

44 

Cards 

47 

Figures 

55 

Identical  Pictures 

41 

45 

Visual  Number  Span 

59 

52 

41 

Word  Transformations 

42 

Letter  Series 

50 

45 

Terman  Concept  Mastery 

58 

54 

UAIS  Information 

52 

47 

Digit  Span  Forward 

44 

43 

Vocabulary 

48 

47 

Picture  Arrangement 

49 

Object  Assembly 

42 

Verbal  Scale  Score 

52 

45 

Performance  Scale  Score 

57 

Full  Scale  Score 

52 

52 

Note.  Decimals  and  correlations  less  than  .41  omitted 


correlated  significantly  with  I4AIS  Verbal  Scale  Score.  However, 
these  same  cells  also  had  high  correlations  with  spatial  tests, 
thus  the  relationship  is  with  general  ability.  On  the  other 
hand.  23  cells  correlated  significantly  with  spatial  tests.  The 
remaining  three  cells  did  not  correlate  with  anything  in  the 
battery. 

Therefore,  component  latency  and  correctness  were  largely 
independent  aspects  of  performance.  Long  solution  times  on 
difficult  additions  were  associated  with  high  verbal  ability. 
Latency  on  easy  additions  was  unrelated  to  either  spatial  or 
verbal  ability.  Correctness  on  both  easy  and  difficult  items  was 
associated  with  spatial  ability. 

gJL«  im.fi  S.Btftd  ■ 

Twenty-one  of  the  24  correlations  between  construction 
component  cell  mean  times  and  the  Closure  Speed  reference  factor 
were  negative.  However,  only  two  were  significant.  The  only 
design  facet  shared  by  the  ten  highest  correlations  (r=-.30  to 
-.60)  was  low  product  complexity.  None  of  the  SCORE3  cell  total 
correlations  with  the  Closure  Speed  factor  were  significant  (in 
fact,  only  two  were  above  .17).  Thus,  as  argued  elsewhere 
(Lohnan.  1977b;  1979),  Closure  Speed  is  not  a  separate  cognitive 

skill,  but  merely  one  aspect  of  spatial  ability.  Further,  these 
results  support  the  argument  that  the  factor  would  be  better 
represented  by  latency  than  the  usual  index  of  total  correct 
within  a  given  time  limit. 

the  Rotation  £ame.g.ag.n  1 

Of  the  216  items.  144  required  rotation  and  thus  had  a 
rotation  component  time.  Since  construction  always  preceded 
rotation,  rotation  component  times  for  the  96  items  that  also 
required  construction  were  not  experimentally  independent  of  the 
construction  operations.  Rotation  time  on  these  items  depended 
not  only  on  the  success  of  the  rotation,  but  also  on  the  success 
of  the  preceding  addition. 

Separate  analyses  were  performed  on  the  48  items  where 
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rotation  was  not  preceded  by  construction  and  on  the  96  items 
requiring  both  construction  and  rotation.  Details  of  the  latter 
analysis  are  presented  in  Lohman  (1979b).  and  Mill  be  reviewed 
only  briefly  here. 


Analyses  of  variance  on  rotation  time  and  SCO  RE  3  tor  the  48 
no  construction  items  are  shown  in  Tables  3. IS  and  3.16. 

Separate  analyses  were  also  performed  on  the  sum  and  difference 
of  normalized  rotation  time  and  correctness  scores.  However, 
since  there  were  no  discrepancies  between  the  rotation  time  and 
SC0RC3  analyses,  the  sum  and  difference  scores  provided  no  unique 
information.  The  difference  score  analysis  coincided  with  those 
reported  in  Tables  3.15  and  3.16.  while  neither  facets  nor 
subject-facet  interactions  were  significant  in  the  sum  score 
analysis. 

Both  rotation  time  and  SC0RE3  showed  large  main  effects  for 
rotation,  no  main  effect  for  stimulus  complexity,  and  a  small, 
but  significant  rotation  by  stimulus  complexity  interaction. 

There  were  also  large  practice  effects.  Both  rotation  time  and 
errors  decreased  linearly  over  the  four  blocks.  Figure  3.6a 
shows  a  substantial  decrease  in  rotation  time  over  blocks,  but 
only  a  slight  increase  in  slopes.  As  is  shown  in  Figure  3.6b. 
the  decrease  in  errors  was  much  larger  for  180  degree  rotations 
than  for  90  degree  rotations. 

Although  the  rotation  by  stimulus  complexity  interaction  wrs 


significant  in  both  analyses,  it  is  probably  not  reliable.  The 
interaction  for  rotation  time  is  plotted  separately  for  each 
block  in  Figure  3.7.  The  interaction  was  different  in  each 
block,  and  significant  only  in  the  first  two  blocks. 


If  nothing  else.  Figure  3.7  reveals  that  what  was  happening 
in  Block  1  is  not  the  same  as  what  was  happening  in  Blocks  3  and 
4.  This  suggests  that  practice  alters  the  relationships  between 


rotation  times  and  scores,  and  external  variables. 


Table  3.13 

Analysis  of  Variance  for  Rotation  Component  Time 


Source 

df 

MS 

F 

Source 

df 

MS 

F* 

Mean 

1  3162.33 

195.34* 

N 

29 

16.18 

Rotation 

1 

43.27 

35.41* 

NR 

29 

1.22 

1.54 

Stimulus  Complexity 

(2) 

NS 

(58) 

Linear 

1 

.05 

.07 

Linear 

29 

.83 

1.05 

Quadratic 

l 

1 .44 

3.43 

Quadratic 

29 

.42 

.53 

RS 

(2) 

NRS 

(58) 

Linear 

1 

1.99 

2.11 

Linear 

29 

.94 

- 

Quadrat lc 

l 

7.81 

12.24* 

Quadrat ic 

29 

.63 

- 

^SCe)  estimated  by 

MRS 

Interaction. 

*p  less  than  .01 
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Table  3.16 

Analysis  of  Variance  for  Rotation  SC0RE3 


Source 

df 

MS 

F  •  ‘ 

Source 

df 

MS 

F  * 

Mean 

1 

1 10.646 

1121.19* 

N 

29 

.099 

- 

Rotation 

1 

.176 

11.15* 

NR 

29 

.016 

.81 

Stimulus  Complexity 

(2) 

NS 

(58) 

Linear 

1 

.038 

2.38 

Linear 

29 

.016 

.89 

Quadratic 

1 

.002 

.11 

Quadratic 

29 

.019 

.81 

RS 

(2) 

NRS 

(58) 

Linear 

1 

.022 

1.01 

Linear 

29 

.022 

- 

Quadratic 

1 

.207 

11.96* 

Quadratic 

29 

.017 

“SlSCe)  estimated  by  the  NRS  interaction. 
*p  less  than  .01 
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Correlations  between  mean  SCORE  and  rotation  time  within 
each  block,  and  selected  reference  variables  are  shown  in  Table 
3.17.  SCORE  had  its  maximum  correlations  in  Blocks  3  and  4, 
while  rotation  time  correlations  were  highest  in  Blocks  1  and  2. 
While  SCORE  land  SC0RE3)  had  many  significant  correlations  with 
reference  tests,  mean  rotation  time  correlated  significantly  only 
with  number  of  errors  on  the  Cards  test. 

When  collapsed  over  blocks,  number  correct  180  degree 
rotations  correlated  slightly  higher  with  reference  tests  than 
the  number  of  correct  90  degree  rotations.  For  example, 
correlations  with  Paper  Folding  were  .46  and  .53,  and  with 
Surface  Development  .44  and  .54,  for  90  and  180  degree  rotations, 
respectively. 

Once  again,  steep  slopes  were  related  to  verbal  ability. 

The  rotation  time  contrast  scores  (180  time  minus  90  time) 
correlated  with  UA1S  Comprehension  (.44),  Digit  Span  (.50)  and 
Verbal  Scale  Score  (.46).  For  SC0RE3,  scores  on  the  rotation 
contrast  were  positively  correlated  with  two  visual  memory  tests: 
Film  Memory  111  (r*.53)  and  Visual  Humber  Span  (r=.51).  Since 
Visual  Humber  Span  usually  falls  with  the  verbal  tests  or  verbal 
strategy  indices,  and  has  frequently  opposed  the  Marks  Vividness 
of  Visual  Imagery  dues t i ona i r e ,  the  interpretation  of  these 
correlations  is  uncertain.  Those  who  did  not  show  a  large 
decline  in  correctness  between  90  and  180  degree  rotations  may 
have  had  good  visual  memory,  and  relied  on  it  to  rotate  the 
entire  stimulus,  ignoring  other  cues  that  would  make  the  task 
easier  but  more  error  prone. 

The  linear  stimulus  complexity  scores  for  the  rotation  time 
contrast  correlated  .55  with  Digit  Span.  Similarly,  for  SCORE3, 
the  linear  stimulus  complexity  contrast  correlated  -.52  with 
Digit  Span  and  -.43  with  Uses  for  Things.  Thus,  those  uho  took 
longer  and  missed  more  high  stimulus  complexity  rotations  than 
low  complexity  rotations  were  probably  attempting  to  encode  and 
remember  the  stimulus  or  its  distinctive  features  verbally. 
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Table  3.17 

Correlations  between  Mean  SCORE,  Mean  Rotation  Component 
Time  and  Reference  Tests,  by  Block.  (N*30) 


SCORE 


Reference  Tesc 

Block  1 

Block  2 

Block  3 

Block  ** 

VAIS  IQ 

10 

21 

44* 

26 

Paper  Folding 

2 

34 

64* 

44* 

Surface  Development 

31 

32 

o  1* 

35 

Cards 

02 

26 

39 

•♦4  * 

F igures 

02 

IS 

22 

45* 

Total  Correct  Draw  Trials 

23 

43* 

62* 

41 

ROTATION 

TIME 

Reference  Test 

Block  1 

Block  2 

Block  3 

Block  4 

WAIS  IQ 

23 

25 

-13 

-03 

Cards  Wrong 

43* 

55* 

24 

18 

Identical  Pictures 

31 

30 

17 

12 

Total  Correct  Draw  Trials 

42* 

44* 

17 

19 

Note .  Decimals  omitted,  r  **  .42  significant  at  .01 


Rotation  o f  Corn  t  r voted  Stimuli  . 

The  major  finding  here  uas  that  it  took  longer  to  rotate  a 
constructed  stimulus.  Figure  3.8  shous  the  plot  of  rotation  time 
for  each  level  of  the  construction  facet.  Product  complexity  did 
not  affect  rotation  time,  but  did.  of  course,  have  large  effects 
on  error  rate.  Stimulus  complexity  affected  both  error  rate  and 
rotation  time,  especially  for  constructions  involving  high 
complexity  stimuli.  Houever.  the  previously  noted  limitations  of 
these  data  make  interpretations  risky. 


Match  Component  Times 

Since  accepting  or  rejecting  the  test  stimulus  uas  the  last 
component  operation  in  every  item,  match  component  times  depend 
on  the  success  of  all  previous  operations.  As  such,  they  are  the 
most  difficult  component  times  to  interpret. 

Separate  analyses  were  performed  on  the  2k  match  lines 
preceded  only  by  encoding  and  all  216  match  times  Although 
analyses  of  variance  uere  performed  on  natch  times  and  on  the  sum 
and  difference  oi  separately  normalized  match  times  and  a 
correctness  index,  only  descriptive  statistics  and  correlational 
results  are  summarized  here.  Details  of  the  analyses  of  variance 
are  reported  in  Lohman  (1979b). 


All  Match  Times .  Subjects  uere  first  divided  into  three  groups  on 
the  basis  of  total  SCORE’S  over  all  216  items.  Mean  match  time 
uas  then  computed  for  each  design  facet  separately  for  correct 
and  incorrect  responses  for  each  group. 

The  major  findings  uere: 

1.  Match  times  uere  shorter  for  correct  responses  than 
incorrect  responses.  Mean  times  uere  1.43  and  1.76 
sec  for  correct  and  incorrect  responses, 
respectively.  This  is  exactly  the  opposite  of  the 
results  for  the  simple  RT  control.  There,  errors 
uere  consistently  faster  than  correct  responses. 

2.  Mean  natch  time  decreased  linearly  over  the  three 
groups,  both  for  correct  and  incorrect  responses. 
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Further,  incorrect  match  times  were  longer  than 
correct  times  in  all  three  groups.  These  data  are 
shown  in  Figure  3.9. 

3.  There  was  a  substantial  interaction  between 

correctness  of  the  test  probe  and  correctness  of 
the  response.  These  data  are  plotted  in  Figure 

e 

3.10a.  "Yes"  responses  were  faster  on  correct 
items,  and  "no”  responses  were  faster  on  incorrect 
items.  These  differences  were  strongest  in  the  low 
and  medium  groups. 

*4.  Cell  means  for  the  rotation  facet  are  plotted  in 

Figure  3.10b.  All  three  groups  showed  an  increase 
in  natch  time  on  rotation  items  for  incorrect 
responses.  However,  only  the  low  group  showed  a 
consistent  increase  for  correct  responses.  In 
fact,  both  correct  and  incorrect  match  times 
increased  linearly  over  levels  of  rotation  in  the 
low  group.  In  the  high  group,  on  the  other  hand, 
correct  match  times  were  consistently  rapid, 

5.  For  construction,  however,  match  time  for  all  three 
groups  were  greater  for  construction  than  no 
construction  items.  As  shown  in  Figure  3.10c.  the 
most  pronounced  difference  between  groups  was  the 
magnitude  of  the  increase  between  no  construction 
and  one  piece  additions. 

6.  Match  times  for  correct  responses  increased 
linearly  over  both  the  stimulus  complexity  and  type 
of  addition  facets.  However,  the  increases  were 
all  less  than  one  second. 

7.  Product  complexity  had  large  and  consistent  effects 
for  all  three  groups.  These  means  are  plotted  in 
Figure  3.10d. 

Correlations  . 

Match  time  was  the  only  component  time  that  correlated 
significantly  with  spatial  tests.  Separate  correlations  with  the 
reference  tests  were  computed  for  mean  match  time  over  all  £16 
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items  and  mean  match  time  over  the  24  no  construction,  no 
rotation  items.  Four  correlations  were  significant  in  one  or 
both  analyses.  These  correlations  are  shoun  in  Table  3. 18.  The 
surprize  here  uas  that  match  time  in  the  no  construction,  no 
rotation  cells  correlated  with  the  reference  spatial  tests  as 
uell  as  overall  match  time.  The  mean  error  rate  for  the  24  no 
construction,  no  rotation  items  uas  only  4.4  percent,  or  one 
item.  Thus,  the  correlation  is  not  an  artifact  of  the 
differential  match  times  associated  with  correctness.  Further, 
neither  mean  encoding  time,  mean  total  time  (encoding  time  plus 
match  time)  nor  total  correct  over  these  24  items  correlated 
significantly  uith  any  of  the  reference  tests. 

It  uas  hypothesized  that  match  times  or  total  time  on  these 
simple  no  construction,  no  rotation  items  might  be  related  to  one 
or  more  of  the  perceptual  speed  tests,  particularly  Identical 
Pictures.  Houever,  mean  match  time  on  these  items  correlated 
only  -.33,  and  total  time  -.02  uith  Identical  Pictures. 

Summary  2_t  Watch  Component  Analyses. 

Those  uho  did  not  perform  uell  on  this  task  shoued  large 
increases  in  correct  match  times  betueen  no  construction  and 
construction. items,  and  a  linear  increase  in  correct  match  time 
over  the  rotation  facet.  Those  uho  performed  uell  shoued  very 
little  increase  over  the  construction  facet  and  no  increase  over 
rotation  in  correct  match  time.  This  could  indicate  that  lou 
ability  subjects  had  not  completed  the  construction  or  rotation 
operations  before  advancing  to  the  match  frame. 

Tuo  pieces  of  evidence  argue  against  this  interpretation. 
First,  although  construction  times  increased  dramatically  betueen 
one  and  tuo  piece  additions,  match  times  did  not.  In  fact,  in 
the  lou  performance  group,  mean  correct  match  time  uas  the  same 
at  levels  tuo  and  three  of  the  construction  facet.  On  the  other 
hand,  mean  correct  match  times  increased  linearly  over  the 
rotation  facet  uithin  this  lou  performance  group.  If  these 
subjects  uere  advancing  to  the  next  frame  before  completing  the 
requested  construction  or  rotation  operation,  then  they  did  so 
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Table  3.18 


Correlations  between 

Mean  Match  Times  and 

Reference  Tests  (N»30) 

Reference  Test 

216  Items 

24  Items 

Form  Board 

-40 

-44 

Surface  Development 

-47 

-39 

Memory  for  Designs 

-41 

-48 

F igures 

-46 

-4  3 

Note.  Decimals  omitted. 
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differentially  for  construction  and  rotation. 

Second,  the  highest  correlation  between  mean  match  time  over 
the  24  no  construction,  no  rotation  items  and  reference  tests  was 
-.48  with  Memory  for  Designs.  The  crucial  requirements  of  this 
test  are  to  remember  and  reproduce  the  design  in  its  correct 
proportions.  Those  who  have  difficulty  retaining  a  given  design 
in  its  correct  proportions  should  have  even  greater  difficulty 
keeping  a  constructed  or  transformed  (i.e.,  rotated)  figure  in 
its  correct  proportions.  The  comparison  and  decision  processes 
should  take  longer  for  these  distorted  images  than  for  the 
relatively  clear  images  where  no  construction  or  rotation  was 
involved . 

Smith  (1964)  has  long  argued  that  the  essence  of  spatial 
ability  is  the  ability  to  retain  a  stimulus  in  its  correct 
proportions.  The  individual  differences  in  match  times  suggest 
that  this  may  indeed  be  an  important  aspect  of  spatial  ability. 

It  must  be  emphasized,  however,  that  the  spatial  skills 
tapped  by  the  match  times  were  largely  independent  of  the  level 
scores  on  this  task.  The  correlation  betueen  mean  match  time 
over  the  24  items  and  total  SCORE  was  only  -.24.  Thus,  while 
individual  differences  in  match  times  were  related  to  spatial 
ability,  the  basic  hypothesis  of  speed-level  independence  remains 
uncha 1 1  eng  ed . 

V_L.  Comoa  r  i  s  on  2_i  Qvera  1 1  and  Component  Anal  yses 
This  section  compares  and  summarizes  the  major  results  of 
the  total  time  and  correctness,  and  component  time  and 
correctness  analyses. 

I.aJLai.  Scores 

Individual  differences  in  total  correct  (by  SC0RE3)  were 
strongly  correlated  with  spatial  ability.  Total  time,  on  the 
other  hand,  was  independent  of  spatial  ability  and  weakly  related 
to  verbal  ability. 

Generally  fast  accurate  performance  on  this  task  was 
correlated  with  only  one  reference  test,  UAIS  Picture 
Arrangement.  The  tendency  to  opt  for  accuracy  instead  of  speed 
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was  related  to  general  ability. 


Susad-ina 

Encoding  component  time  decreased  linearly  over  stimulus 
complexity  and  the  four  blocks  of  test  trials.  Individual 
differences  in  mean  encoding  time  uere  highly  correlated  uith  the 
intercept  parameters  from  the  regressions  of  total  and  correct 
times  on  percent  failing  each  item.  Further,  individual 
differences  in  mean  encoding  time  uere  independent  of  the  slope, 
predicted  times  at  maximum  difficulty,  and  all  of  the  reference 
tests,  except  number  of  pencil  marks  made  on  the  Form  Board  test. 

CLaa.lx  « l  gju.rAU.Ua.ns:  • 

Relatively  large  increases  in  encoding  time  over  the  stimulus 
complexity  facet,  especially  between  medium  and  high  complexity, 
uere  associated  uith  verbal  ability  and  memory  span.  It  was 
suggested  that  the  reported  strategies  of  attempting  to  remember 
a  complex  initial  stimulus  by  vieuing  it  as  a  combination  of 
simpler  forms,  or  attempting  to  associate  the  stimulus  with  a 
concrete  image  would  result  in  relatively  long  encoding  times  for 
high  complexity  stimuli. 

Change  in  encoding  time  over  blocks  was  not  related  to  the 
reference  tests  or  overall  performance  on  the  experimental  task. 

Construction 

tle.ans  A.oaI.V-1  is  • 

Tuo  piece  additions  took  much  longer  than  one  piece 
additions,  but  did  not  result  in  more  errors.  On  all  other 
facets  related  to  construction,  error  rate  and  total  or  component 
time  increased  together.  The  most  difficult  constructions  uere 
left  additions  of  complex  stimuli  that  combined  to  form  high 
complexity  products.  Left  additions  were  more  difficult  than 
right  additions  only  uhen  tuo  pieces  had  to  be  added  to  the  base. 
Left  additions  that  resulted  in  high  complexity  products  uere 
more  difficult  than  corresponding  right  additions.  Difficulty 
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increased  nonlinearly  over  stimulus  complexity,  with  the 
difference  between  high  and  medium  complexity  larger  than  the 
corresponding  difference  between  medium  and  lou  complexity 
stimuli.  The  largest  changes  in  difficulty  were  captured  by  the 
product  complexity  facet,  with  high  complexity  products  much  more 
difficult  to  construct  than  lou  complexity  products. 

CgllJ;  ra?  t  Correlations  . 

M.  Contrasts  and  Cell  Totals .  Mean  construction  component 
time  cqrrelated  significantly  uith  six  verbal  tests.  Mean 
construction  time  on  easy  items  uas  independent  of  the  reference 
tests,  while  mean  time  on  difficult  items  uas  highly  correlated 
uith  verbal  ability.  Total  SCORE 3 ,  on  the  other  hand,  uas 
strongly  related  to  performance  on  spatial  tests,  both  for  easy 
and  difficult  items. 

Construction  (£,  J£)  .  In  both  the  total  time  and  construction 
component  analyses,  steep  slopes  were  related  to  verbal  ability. 
Further,  the  quadratic  contrast  in  the  total  time  analysis 
correlated  positively  uith  reported  vividness  of  visual  imagery 
and  negatively  uith  Hidden  Figures.  The  linear  SC0RE3 
construction  contrast  scores  correlated  positively  uith  Form 
Board  and  Object  Assembly,  while  the  quadratic  contrast  scores 
had  equally  large,  but  negative  correlations  with  these  two 
tests.  Thus,  high  scores  on  the  Form  Board  and  Object  Assembly 
reference  tests  were  related  to  a  linear  decline  in  performance 
over  the  three  levels  of  the  construction  facet. 

For  the  combined  latency  and  correctness  indices,  larger 
increases  in  difficulty  between  one  and  two  piece  additions  than 
between  zero  and  one  piece  additions  uere,  once  again,  related  to 
Form  Board  and  Object  Assembly.  Houever,  magnitude  of  the  simple 
difference  betueen  difficulty  of  one  and  tuo  piece  additions  uas 
related  to  verbal  ability. 

Shifting  to  slower,  more  accurate  performance  betueen  levels 
one  and  three  uas  associated  uith  verbal  ability,  uhile 
relatively  larger  shifts  betueen  one  and  tuo  piece  additions  than 
betueen  zero  and  one  piece  additions  uere  negatively  related  to 
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spatial  ability.  Thus,  high  spatial  ability  subjects  shoued  the 
greatest  change  in  speed-accuracy  tradeoff  betueen  no  addition 
and  one  addition  items,  rather  than  betueen  one  and  tuo  piece 
add i t ions . 

Stimulus  Complexity.  In  both  the  total  time  and  construction 
component  analyses,  the  linear  stimulus  complexity  contrast  uas 
related  to  verbal  ability. 

The  quadratic  contrast  uas  strongly  related  to  verbal,  verbal 
reasoning,  and  perceptual  speed  tests  in  the  total  time  analysis, 
but  more  related  to  general  ability  in  the  construction  component 
time  analysis. 

Both  linear  and  quadratic  SC0RE3  stimulus  complexity 
constrasts  uere  independent  of  the  reference  tests  in  the  overall 
analysis,  but  ueakly  related  to  Copying  and  Figures  in  the 
analysis  of  the  144  construction  items. 

For  the  composite  difficulty  index,  those  uho  took  much 
longer  on  constructions  involving  high  complexity  stimuli  than  on 
those  using  lou  complexity  stimuli  scored  higher  on  a  cluster  of 
numerical  or  reasoning  tests.  A  relatively  larger  increase  in 
difficulty  betueen  medium  and  high  complexity  than  betueen  lou 
and  medium  complexity  uas  related  to  both  spatial  ability  and 
memory  span. 

Strong  shifts  to  slouer,  more  accurate  responding  betueen  lou 
and  high  stimulus  complexity  additions  uas  associated  uith  high 
scores  on  uord  fluency  tests. 

Type  Addition.  In  both  total  time  and  construction 
component  analyses,  taking  longer  on  left  additions  than  right 
additions  uas  related  to  good  performance  on  Object  Assembly  and 
verbal  tests.  Further,  those  uho  missed  many  more  left  additions 
than  right  additions  had  high  scores  on  the  Visual  Number  Span 
test,  while  those  uhose  performance  uas  about  the  same  on  both 
reported  strong  visual  imagery. 

For  the  combined  latency  and  correctness  indices,  more  errors 
and  longer  latencies  on  left  additions  uere  even  more  strongly 
associated  uith  high  scores  on  Visual  Number  Span  and  lou  scores 
on  the  Marks  Vividness  of  Visual  Imagery  Qu es t i ona i re .  Switching 


to  slower,  more  accurate  responding  on  left  additions  was 
indicative  of  high  scores  on  form  Board  and  Object  Assembly. 

Product  Complexity.  In  both  the  total  time  and  construction 
component  time  analyses,  taking  longer  on  high  product  complexity 
additions  was  strongly  related  to  general  ability.  Time  taken  on 
lou  product  complexity  additions  was  also  negatively  correlated 
uith  Closure  Speed.  On  the  other  hand,  making  relatively  more 
errors  on  high  complexity  additions  was  weakly  related  to  verbal 
ability. 

The  combined  latency  and  correctness  difficulty  indices  were 
also  strongly  related  to  general  ability  in  both  analyses.  The 
magnitude  of  the  speed-accuracy  tradeoff,  on  the  other  hand,  was 
uncorrelated  uith  the  reference  tests  and  only  weakly  related  to 
an  "internal”  variable,  total  number  of  correct  draw  trials. 

j»t  i  pn 

tjgJLD?.  Anal  vs  is  . 

Total  time  increased  nonlinearly  while  correctness  decreased 
linearly  over  the  rotation  facet.  The  combined  difficulty  index, 
however,  showed  a  perfectly  linear  increase.  Further,  there  was 
a  general  shift  to  slower,  more  careful  responding  when  rotation  - 
was  req u i r ed  . 

Both  rotation  time  and  errors  decreased  linearly  over  the 
four  blocks.  The  decrease  was  largest  for  errors  on  180  degree 
rotations . 

Although  the  interaction  between  stimulus  complexity  and 
rotation  was  significant  in  both  the  component  time  and  SC0RE3 
analyses,  the  interaction  was  different  in  each  block  and 
significant  only  in  the  first  two  blocks. 

Rotation  component  time  increased  linearly  over  the 
construction  facet.  However,  interpretation  is  difficult  since 
construction  always  preceded  rotation. 

&aal E.a.s.t  Correlations  . 

H  Contrast .  Mean  rotation  time  correlated  significantly  with 
number  of  errors  on  the  Cards  test.  When  analyzed  by  block,  the 
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correlation  uas  significant  only  in  the  first  block.  Total 
number  correct  on  rotation  only  items  correlated  significantly 
uith  spatial  ability,  particularly  in  blocks  three  and  four.  The 
highest  correlations  in  block  four  were  with  Figures.  Cards,  and 
Paper  Folding.  Number  of  correct  180  degree  rotations  correlated 
slightly  higher  with  spatial  reference  tests  than  number  of 
correct  90  degree  rotations. 

Rotation.  Steep  slopes  were  associated  uith  Cards  Wrong  in 
the  total  time  analysis  ( R  ■  0,  90,  180).  and  verbal  ability  and 

Digit  Span  in  the  component  time  analysis  (R  =  90,  180).  A 

linear  decline  in  correctness  uas  related  to  high  scores  on 
Figures  and  Cards  in  the  overall  analysis  (R  *  0.  90.  180).  and 

high  scores  on  tuo  visual  memory  tests  in  the  analysis  of  the  48 
rotation  only  items  (R  =  90,  180). 

Stimulus  Complexity .  A  linear  increase  in  rotation  time  and 
a  linear  decrease  in  correctness  over  stimulus  complexity  uere 
related  to  high  scores  on  memory  span  tests,  particularly  WAIS 
Digit  Span  Backwards .  Further,  those  uho  made  relatively  more 
errors  on  rotations  of  high  complexity  stimuli  performed  uell  on 
Form  Board.  Object  Assembly,  and  Memory  for  Designs. 

tla.ts.ti 

naa_T5  Analysis. 

Match  uas  the  only  nonsignificant  main  effect  in  the  analysis 
of  total  time.  This  uas  the  result  of  a  substantial  interaction 
betueen  correctness  of  the  test  probe  and  correctness  of  the 
response.  In  particular,  "yes"  responses  uere  faster  on  correct 
items  and  "no"  responses  faster  on  incorrect  items.  Those  uho 
performed  uell  on  the  experimental  task  shoued  no  increase  in 
match  time  over  the  rotation  facet  and  a  small  increase  betueen 
construction  and  no  construction  items.  Those  uho  performed 
poorly  shoued  a  linear  increase  in  match  time  over  the  rotation 
facet  and  a  large  increase  in  match  time  on  construction  items. 
Both  correct  and  incorrect  match  times  uere  consistently  longer 
for  high  than  for  lou  product  complexity  stimuli. 


For  the  match  facet  in  the  total  time  analysis,  faster 


responses  to  positive  test  stimuli  were  related  to  high  scores  on 
the  Form  Board  and  Picture  Completion  tests.  Faster  responses  to 
negative  test  stimuli  were  associated  with  more  errors  on  the 
Cards  test  and  higher  scores  on  Visual  Number  Span. 

Mean  natch  component  time  was  the  only  component  time  that 
correlated  significantly  with  spatial  ability.  These 
correlations  were  the  same  for  all  216  items  and  the  2b  no 
construction,  no  rotation  items.  However,  mean  m.itch  time  was 
not  significantly  correlated  with  level  scores  on  the 
experimental  task. 


y LI. ■  S-gnejial  Discussion 

The  results  of  this  experiment  have  several  implications  for 
research  on  aptitude  processes. 

The  first  important  finding  was  that  individual  differences 
in  speed  of  solving  simple  items  were  independent  of  level  scores 
on  the  experimental  task  and  reference  tests.  Thus,  it  is 
unlikely  that  a  process  understanding  of  spatial  ability  will  be 
obtained  by  modeling  individual  differences  in  speed  of  solving 
simple  spatial  tasks. 

The  second  important  finding  was  that  individual  differences 
in  the  speed  of  solving  complex  tasks  were  consistently  related 
to  verbal  reference  tests.  The  implication  here  is  that  at  least 
some  subjects  solved  complex  items  using  ve r ba 1 -ana  1 y t i c 
strategies.  This  does  not  mean  that  all  subjects  used  nonspatial 
strategies  on  complex  items,  or  that  individual  differences  in 
the  speed  of  solving  complex  items  are  generally  unrelated  to 
spatial  ability.  The  most  likely  possibility  is  that  some 
subjects  solve  complex  items  on  spatial  tests  using  spatial 
strategies,  others  use  ve r ba 1 -ana  1 y t i c  strategies,  while  most  use 
both  spatial  and  verbal  processes.  The  high  positive 
correlations  between  solution  latency  for  complex  items  and 
verbal  reference  tests  probably  result  from  greater  variance  in 


solution  latency  for  those  itens  solved  by  verbal-analytic 
strategies.  This  greater  variance  in  latency  in  the  nonspatial 
group  would  completely  overshadow  the  much  smaller  variation  in 
latency  scores  in  the  spatial  group.  Correlations  for  the  t  o  t  a  ’ 
group  would  then  tend  to  reflect  the  correiatic  in  the  verbal 
and  nixed  strategy  groups. 

The  third  important  result  was  that  level  scores  on  this  task 
were  highly  correlated  with  scores  on  a  number  of  complex  spatial 
tests  in  the  reference  battery.  Thus,  the  findings  are  not 
limited  to  this  particular  task.  This  suggests  that  these 
reference  spatial  tests  also  can  be  solved  in  more  than  one  way. 
Those  who  have  studied  retrospective  and  introspective  reports  of 
solution  strategy  on  spatial  tests  have  reached  a  similar 
conclusion  (see  Lohnan.  1979a  for  a  review).  In  short,  the  major 
challenge  for  any  spatial  test,  whether  simple  or  complex,  is  to 
keep  it  .spatial. 

There  are  two  additional  implications  of  the  "nonspatial" 
nature  of  most, spatial  tests.  First,  verbal  and  spatial 
abilities  may  be  more  independent  than  the  literature  on  mental 
tests  suggests.  Second,  if  some  subjects  solve  items  on  a 
carefully  constructed  spatial  test  or  experimental  task  using 
nonspatial  strategies,  then  perhaps  spatial  ability  is  not  that 
important  in  the  real  world  (c.f.  Cronbach.  1970),  except,  of 
course.  wh?n  solution  time  is  a  crucial  factor  (e.g.,  piloting  an 
airplane  (Guilford  C  Lacey.  1947)).  It  is  also  possible  that 
spatial  ability  tests  would  have  higher  validity  coefficients  if 
they  could  be  made  truly  spatial.  One  simple  way  to  do  this 
would  be  to  utilize  both  error  and  latency  data,  as  in  the  SCORE3 
index  used  in  this  study. 

If  some  subjects  solved  some  items  on  this  experimental  task 
using  nonspatial  strategies,  then  the  obvious  next  question  is 
who  was  using  these  nonspatial  strategies,  and  on  what  items? 
Unfortunately,  the  experimental  task  was  not  designed  to  answer 
these  questions.  In  the  present  experiment,  the  predicted 
patterns  for  latency  and  error  data  over  the  design  facets  are 
the  same  for  both  spatial  and  v e r ba 1 -ana  1 y t i c  solution 
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strategies.  What  is  needed  is  an  experimental  task  for  which 
different  solution  strateagies  would  yield  qualitatively 
different  patterns  of  scores  over  specified  design  facets. 

The  most  likely  candidates  for  this  type  of  experiment  are 
the  first  and  last  component  processes;  namely  encoding  and 
discriminative  response  (or  match).  The  encoding  phase  is 
important  because  the  nature  of  the  internal  representation  of 
the  stimuli  may  determine,  predispose,  or  at  least  limit  the  type 
of  transformational  processes  that  may  operate  on  it.  Thus,  the 
subject  who  "burns  in"  a  holistic  representation  is  more  likely 
to  rotate  the  entire  stimulus  than  one  who  encodes  the  same 
stimulus  as  "a  rectangle  with  a  small  triangle  cut  out  of  the 
right  side." 

At  the  other  end  of  the  item,  individual  differences  in 
component  latencies  for  the  discriminative  response  were  the  only 
latency  measures  that  correlated  with  spatial  ability.  Further, 
scoring  items  with  abnormally  long  discriminative  response  times 
incorrect  was  the  major  factor  in  making  SC0P.E3  a  "cleaner" 
measure  of  spatial  ability  than  total  number  correct  (SCORE). 
Cooper  (1976)  has  also  found  important  individual  differences  in 
discriminative  response  time  on  spatial  tasks. 

For  both  encoding  and  discriminative  response  component 
times,  the  crucial  factor  appears  to  be  the  nature  of  the 
internal  representation  the  subject  is  either  constructing  or 
comparing  with  the  test  stimulus.  Subjects  who  attempted  to 
decompose  complex  stimuli  into  more  familiar,  labelable  pieces 
took  longer  to  encode,  add,  rotate,  and  compare  the  result  with  a 
test  s  t  imulus  . 

Another  result  that  underscores  the  importance  of  the 
internal  representation  was  that  individual  scores  for  the 
product  complexity  contrast  were  the  best  predictors  of  level 
scores  on  the  experimental  task.  The  interesting  aspect  of  this 
facet  is  that  it  is  a  characteristic  of  the  internal 
representation  subjects  are  assumed  to  construct,  and  is 
independent  of  the  complexity  of  the  stimuli  the  subject  actually 
sees.  Again,  however,  subjects  using  both  verbal-analytic  and 
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spatial  stratgies  should  show  effects  in  the  same  direction  over 
the  facet . 

This  study  has  also  pointed  to  some  important  methodological 
considerations  for  research  on  aptitude  processes.  First,  it 
underscores  the  important  differences  between  individual 
differences  in  speed  and  level.  Inferences  about  the  nature  of 
general  aptitude  constructs  based  on  experiments  that  model 
latency  for  relatively  error  free  tasks  are  suspect. 

Second,  the  method  of  computing  sum  and  difference  scores  for 
the  joint  analyses  of  error  and  latency  data,  while  superior  to 
other  methods  that  have  been  proposed  (Pachella,  1974;  Sternberg, 
1977),  is  still  inadequate.  When  a  subject  fails  to  solve  an 
item,  the  reaction  time  for  that  item  is  ambiguous.  Statistical 
manipulations  cannot  erase  the  psychological  ambiguity  of  the 
data.  A  retreat  to  simple  tasks  and  error  free  performance  is 
not  the  answer,  as  this  study  has  amply  demonstrated.  The 
important  methodological  realization  is  that  models  that  account 
for  performance  on  complex  items  are  necessarily  models  for  high 
ability  subjects.  Ability  is  confounded  with  item  complexity 
because  it  is  defined  by  the  maximum  difficulty  of  the  items  the 
subject  can  solve.  Experimentally,  then,  a  tailored  testing 
approach  should  prove  more  useful  than  the  typical  procedure 
where  all  subjects  attempt  all  items.  The  motivation  of  the  low 
ability  subjects  should  also  improve  since  they  would  not  be 
continually  confronted  with  items  that  they  cannot  solve. 
Rectangular  rather  than  representative  sampling  of  subjects  could 
then  be  used  to  insure  the  same  number  of  high,  medium,  and  low 
ability  subjects  in  the  design.  Low  ability  subjects  would 
provide  unambiguous  latency  data  for  easy  items,  medium  ability 
subjects  for  easy  and  medium  difficulty  items,  and  high  ability 
subjects  for  easy,  medium,  and  difficult  items.  The  analysis 
could  then  contrast  the  way  low,  medium,  and  high  ability 
subjects  solve  easy  items;  the  way  medium  and  high  ability 
subjects  solve  medium  difficulty  items  and  the  way  high  ability 
subjects  solve  difficult  items.  Contrasts  within  an  ability 
group  could  also  be  made,  such  as  between  the  models  that  best 
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fit  the  performance  of  high  ability  subjects  on  easy,  medium,  and 
difficult  i terns . 

Another  implication  for  future  research  stems  from  the 
conclusion  that  subjects  solve  different  items  in  different  ways. 
The  analysis  must  go  beyond  fitting  one  model  for  each  subject. 

It  must  focus  on  the  way  a  particular  subject  solves  a  particular 
item  type.  Level  of  practice,  item  difficulty,  and 
susceptibility  of  the  item  to  alternative  solution  strategies  are 
three  possible  blocking  factors  for  such  a  design. 
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SHA.P.IILR  IV 

SUMMARY 


ch  on  the  nature  of  aptitude  processes  has 
ignored  the  important  differences  between  speed  and  level  of 
performance.  However,  investigations  of  the  relationship  between 
individual  differences  in  speed  and  level  suggest  that  speed  of 
solving  simple  spatial  tasks  is  largely  independent  of  level 
attained  on  more  complex  spatial  tasks. 

The  present  experiment  was  designed  to  investigate  the 
relationships  between  speed,  level,  and  complexity  in  individual 
differences  in  spatial  ability.  Thirty  male  high  school  and 
college  students  were  selected  to  represent  a  wide  range  of 
individual  differences  in  verbal  ability  and  spatial  ability. 
Subjects  were  required  to  match,  rotate,  combine,  and  combine  and 
then  rotate  three  to  eight  point  polygons.  Three  levels  of 
construction  (one.  two,  or  three  stimuli),  three  levels  of 
stimulus  complexity  (low.  medium,  or  high),  three  levels  of 
rotation  (0.  90,  or  180  degrees),  and  two  types  of  discriminative 

response  (correct  or  incorrect)  were  fully  crossed  in  the  design. 
Additionally,  type  of  addition  (left  or  right)  and  complexity  of 
the  to-be-constructed  stimuli  were  nested  within  levels  of  the 
construction  facet  but  crossed  with  each  other  and  all  other 
facets.  Since  each  subject  was  administered  all  216  items, 
subjects  were  the  seventh  fully  crossed  factor. 

Thirty  trials  on  which  the  subject  was  asked  to  draw  the 
final  stimulus  rather  than  merely  to  recognize  it  were  randomly 
interspersed  with  the  test  trials.  A  simple  reaction  time 
control  condition  preceded  the  experiment. 

Correctness,  confidence,  and  from  two  to  four  component  times 
were  obtained  on  each  item.  Total  time  was  determined  by  summing 
the  component  times.  These  data  were  analyzed  using  regression, 
analysis  of  variance,  and  correlational  techniques. 

In  the  regression  analyses,  latency  measures  were  regressed 
on  a  group  difficulty  scale.  Correlations  between  intercepts, 
slopes,  predicted  times  at  maximum  difficulty,  and  level  scores 
and  reference  constructs  supported  the  basic  hypothesis  of  speed- 
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level  independence.  In  particular,  individual  differences  m 
speed  of  solving  simple  items  were  independent  of  both  level  on 
the  task  and  reference  abilities.  Speed  of  solving  complex  items 
was  strongly  correlated  with  level  on  the  task,  but  more  highly 
correlated  with  verbal  than  spatial  reference  constructs. 

In  the  means  analysis,  repeated  measures  analyses  of  variance 
were  performed  on  total  time  and  correctness,  component  times  and 
correctness,  and  combined  indices  representing  the  sum  and 
difference  of  independently  standardized  latency  and  correctness 
scores.  These  combined  analyses  were  performed  on  both  total 
time  and  correctness  as  well  as  each  component  time  and  its 
correctness  index. 

Relationships  between  individual  differences  in  total  scores, 
cell  totals,  and  the  pattern  of  cell  means  over  design  facets 
were  explored  in  the  correlational  analyses.  In  general, 
correlations  between  correctness  indices  and  reference  tests 
supported  the  construct  validity  of  the  task  and  its  facets, 
while  correlations  between  latency  indices  and  reference  tests 
suggested  more  about  the  processes  or  strategies  subjects  were 
using  to  solve  the  items. 

The  major  results  of  the  means  analyses  and  correlational 
analyses  are  summarized  below. 

I-g.l.al  S.SLgxa£ 

Individual  differences  in  total  correct  (by  SC0RE3)  were 
strongly  correlated  with  spatial  ability.  Total  time,  on  the 
other  hand,  was  independent  of  spatial  ability  and  weakly  related 
to  verbal  ability. 

Generally  fast  accurate  performance  on  this  task  was 
correlated  with  only  one  reference  test.  MAIS  Picture 
Arrangement.  The  tendency  to  opt  for  accuracy  instead  of  speed 
was  related  to  general  ability. 

Sjiggdinq. 

Hean*  Analysis . 


Encoding  component  time  decreased  linearly  over  stimulus 


complexity  and  the  four  blocks  of  test  trials.  Individual 
differences  in  mean  encoding  time  were  highly  correlated  uith  the 
intercept  parameters  from  the  regressions  of  total  and  correct 
times  on  percent  failing  each  item.  Further,  individual 
differences  in  mean  encoding  time  were  independent  of  the  slope, 
predicted  times  at  maximum  difficulty,  and  all  of  the  reference 
tests,  except  number  of  pencil  marks  made  on  the  Form  Board  test. 


g.an.l.ra* L  garreUt  ion*. 

Relatively  large  increases  in  encoding  time  over  the  stimulus 
complexity  facet,  especially  between  medium  and  high  complexity, 
were  associated  uith  verbal  ability  and  memory  span.  It  was 
suggested  that  the  reported  strategies  of  attempting  to  remember 
a  complex  initial  stimulus  by  viewing  it  as  a  combination  of 
simpler  forms,  or  attempting  to  associate  the  stimulus  uith  a 
concrete  image  would  result  in  relatively  long  encoding  times  for 
high  complexity  stimuli. 

Change  in  encoding  time  over  blocks  was  not  related  to  the 
reference  tests  or  overall  performance  on  the  experimental  task. 

Construction 

tie, an*  Analysis. 

Two  piece  additions  took  much  longer  than  one  piece 
additions,  but  did  not  result  in  more  errors.  On  all  other 
facets  related  to  construction,  error  rate  and  total  or  component 
time  increased  together.  The  most  difficult  constructions  were 
left  additions  of  complex  stimuli  that  combined  to  form  high 
complexity  products.  Left  additions  were  more  difficult  than 
right  additions  only  when  two  pieces  had  to  be  added  to  the  base. 
Left  additions  that  resulted  in  high  complexity  products  were 
more  difficult  than  corresponding  right  additions.  Difficulty 
increased  nonlinearly  over  stimulus  complexity,  uith  the 
difference  between  high  and  medium  complexity  larger  than  the 
corresponding  difference  between  medium  and  low  complexity 
stimuli.  The  largest  changes  in  difficulty  were  captured  by  the 
product  complexity  facet,  uith  high  complexity  products  much  more 
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difficult  to  construct  than  lou  complexity  products. 


gpnt.rqs t  £g c nala Li  <?  n  ?.  • 

H  Contrasts  and  Cell  Totals .  Mean  construction  component 
time  correlated  significantly  uith  six  verbal  tests.  Mean 
construction  time  on  easy  items  uas  independent  of  the  reference 
tests,  uhile  mean  time  on  difficult  items  uas  highly  correlated 
uith  verbal  ability.  Total  SC0RE3.  on  the  other  hand,  uas 
strongly  related  to  performance  on  spatial  tests,  both  for  easy 
and  difficult  items. 

Construction  (£.  &) .  In  both  the  total  time  and  construction 
component  analyses,  steep  slopes  uere  related  to  verbal  ability. 
Further,  the  quadratic  contrast  in  the  total  time  analysis 
correlated  positively  uith  reported  vividness  of  visual  imagery 
and  negatively  uith  Hidden  Figures.  The  linear  SCORE 3 
construction  contrast  scores  correlated  positively  uith  Form 
Board  and  Object  Assembly,  uhile  the  quadratic  contrast  scores 
had  equally  large,  but  negative  correlations  uith  these  tuo 
tests.  Thus,  high  scores  on  the  Form  Board  and  Object  Assembly 
reference  tests  uere  related  to  a  linear  decline  in  performance 
over  the  three  levels  of  the  construction  facet. 

For  the  combined  latency  and  correctness  indices,  larger 
increases  in  difficulty  betueen  one  and  tuo  piece  additions  than 
betueen  zero  and  one  piece  additions  uere.  once  again,  related  to 
Form  Board  and  Object  Assembly.  Houever,  magnitude  of  the  simple 
difference  betueen  difficulty  of  one  and  tuo  piece  additions  uas 
related  to  verbal  ability. 

Shifting  to  slouer.  more  accurate  performance  betueen  levels 
one  and  three  uas  associated  uith  verbal  ability,  uhile 
relatively  larger  shifts  betueen  one  and  tuo  piece  additions  than 
betueen  zero  and  one  piece  additions  uere  negatively  related  to 
spatial  ability.  Thus,  high  spatial  ability  subjects  shoued  the 
greatest  change  in  speed-accuracy  tradeoff  betueen  no  addition 
and  one  addition  items,  rather  than  betueen  one  and  tuo  piece 
additions. 

Stimulus  Complexity.  In  both  the  total  time  and  construction 


component  analyses,  the  linear  stimulus  complexity  contrast  uas 
related  to  verbal  ability. 

The  quadratic  contrast  uas  strongly  related  to  verbal,  verbal 
reasoning,  and  perceptual  speed  tests  in  the  total  time  analysis, 
but  more  related  to  general  ability  in  the  construction  component 
time  analysis. 

Both  linear  and  quadratic  SC0RE3  stimulus  complexity 
constrasts  uere  independent  of  the  reference  tests  in  the  overall 
analysis,  but  ueakly  related  to  Copying  and  Figures  in  the 
analysis  of  the  144  construction  items. 

For  the  composite  difficulty  index,  those  uho  took  much 
longer  on  constructions  involving  high  complexity  stimuli  than  on 
those  using  lou  complexity  stimuli  scored  higher  on  a  cluster  of 
numerical  or  reasoning  tests.  A  relatively  larger  increase  in 
difficulty  betueen  medium  and  high  complexity  than  betueen  lou 
and  medium  complexity  uas  related  to  both  spatial  ability  and 
memory  span. 

Strong  shifts  to  slower,  more  accurate  responding  betueen  lou 
and  high  stimulus  complexity  additions  uas  associated  uith  high 
scores  on  uord  fluency  tests. 

Type  flj.  Addition.  In  both  total  time  and  construction 
cdmponent  analyses,  taking  longer  on  left  additions  than  right 
additions  uas  related  to  good  performance  on  Object  Assembly  and 
verbal  tests.  Further,  those  uho  missed  many  more  left  additions 
than  right  additions  had  high  scores  on  the  Visual  Number  Span 
test,  uhile  those  uhose  performance  uas  about  the  same  on  both 
reported  strong  visual  imagery. 

For  the  combined  latency  and  correctness  indices,  more  errors 
and  longer  latencies  on  left  additions  uere  even  more  strongly 
associated  uith  high  scores  on  Visual  Number  Span  and  lou  scores 
on  the  Harks  Vividness  of  Visual  Imagery  Questionaire.  Suite hing 
to  slouer.  more  accurate  responding  on  left  additions  uas 
indicative  of  high  scores  on  Form  Board  and  Object  Assembly. 

Product  C  onp 1  ex i t  y .  In  both  the  total  time  and  construction 
component  time  analyses,  taking  longer  on  high  product  complexity 
additions  uas  strongly  related  to  general  ability.  Time  taken  on 


low  product  complexity  additions  uas  also  negatively  correlated 
with  Closure  Speed.  On  the  other  hand,  making  relatively  more 
errors  on  high  complexity  additions  uas  weakly  related  to  verbal 
\  ability. 

The  combined  latency  and  correctness  difficulty  indices  were 
also  strongly  related  to  general  ability  in  both  analyses.  The 
magnitude  of  the  s peed -accu rac y  tradeoff,  on  the  other  hand,  uas 
uncorrelated  with  the  reference  tests  and  only  weakly  related  to 
an  "internal"  variable,  total  number  of  correct  draw  trials. 


tat  ion 

tt£.an-a  Ana  lysis. 

Total  time  increased  nonlinearly  while  correctness  decreasea 
linearly  over  the  rotation  facet.  The  combined  difficulty  index, 
however,  showed  a  perfectly  linear  increase.  Further,  there  uas 
a  general  shift  to  slower,  more  careful  responding  when  rotation 
uas  r eq u i r ed  . 

Both  rotation  time  and  errors  decreased  linearly  over  the 
four  blocks.  The  decrease  uas  largest  for  errors  on  180  degree 
rotat i ons  . 

Although  the  interaction  between  stimulus  complexity  and 
rotation  uas  significant  in  both  the  component  time  and  SCORE 3 
analyses,  the  interaction  uas  different  in  each  block  and 
significant  only  in  the  first  two  blocks. 

Rotation  component  time  increased  linearly  over  the 
construction  facet.  However,  interpretation  uas  difficult  since 
construction  always  preceded  rotation. 

Correlations  . 

H  Contrast.  Mean  rotation  time  correlated  significantly  with 
number  of  errors  on  the  Cards  test.  When  analyzed  by  block,  the 
correlation  uas  significant  only  in  the  first  block.  Total 
number  correct  on  rotation  only  items  correlated  significantly 
with  spatial  ability,  particularly  in  blocks  three  and  four.  The 
highest  correlations  in  block  four  were  with  Figures,  Cards,  and 
Paper  Folding.  Number  of  correct  180  degree  rotations  correlated 
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slightly  higher  with  spatial  reference  tests  than  number  of 
correct  90  degree  rotations. 

Rotation.  Steep  slopes  were  associated  with  Cards  Wrong  :n 
the  total  time  analysis  (R  *  0.  90.  180).  and  verbal  ability  and 

Digit  Span  in  the  component  time  analysis  (R  =  90,  180).  A 

linear  decline  in  correctness  uas  related  to  high  scores  on 

Figures  and  Cards  in  the  overall  analysis  (R  =  0,  90.  180).  and 

high  scores  on  two  visual  memory  tests  in  the  analysis  of  the  48 
rotation  only  items  (R  *  90,  180). 

Stimulus  Complexity .  A  linear  increase  in  rotation  time  and 
a  linear  decrease  in  correctness  over  stimulus  complexity  were 
related  to  high  scores  on  memory  span  tests,  particularly  WAIS 
Digit  Span  Backwards .  Further,  those  who  made  relatively  more 
errors  on  rotations  of  high  complexity  stimuli  performed  uell  on 
Form  Board,  Object  Assembly,  and  Memory  for  Designs. 

ELaAsil 

titans  Anal  ys  is  . 

Match  uas  the  only  nonsignificant  main  effect  in  the  analysis 
of  total  time.  This  uas  the  result  of  a  substantial  interaction 
betueen  correctness  of  the  test  probe  and  correctness  of  the 
response.  In  particular,  "yes"  responses  uere  faster  on  correct 
items  and  "no"  responses  faster  on  incorrect  items.  Those  who 
performed  uell  on  the  experimental  task  shoued  no  increase  in 
match  time  over  the  rotation  facet  and  a  small  increase  betueen 
construction  and  no  construction  items.  Those  uho  performed 
poorly  shoued  a  linear  increase  in  match  time  over  the  rotation 
facet  and  a  large  increase  in  match  time  on  construction  items. 
Both  correct  and  incorrect  matcli  times  uere  consistently  longer 
for  high  than  for  lou  product  complexity  stimuli. 

CflmaUUops  • 

For  the  match  facet  in  the  total  time  analysis,  faster 
responses  to  positive  test  stimuli  uere  related  to  high  scores  on 
the  Form  Board  and  Picture  Completion  tests.  Faster  responses  to 
negative  test  stimuli  uere  associated  uith  more  errors  on  the 
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Cards  test  and  higher  scores  on  Visual  Number  Span. 

[lean  match  component  time  uas  the  only  component  time  that 
correlated  significantly  with  spatial  ability.  These 
correlations  were  the  same  for  all  216  items  and  the  24  no 
construction,  no  rotation  items.  However,  mean  match  time  uas 
not  significantly  correlated  with  level  scores  on  the 
experimental  task. 
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CHAPTER  v 

CONCLUSIONS 

1.  The  most  important  result  of  this  experiment  uas  the 
finding  that  individual  differences  in  speed  of  doing  simple 
spatial  tasks  are  unrelated  to  verbal  ability,  spatial  ability, 
or  level  scores  on  the  experimental  task.  Speed  of  solving 
complex  items  uas  highly  correlated  with  two  task  level  scores 
and  verbal  reference  tests. 

2.  In  general,  individual  differences  in  the  patterns  of 
correctness  cell  means  over  facets  supported  the  construct 
validity  of  the  facet,  uhile  individual  differences  in  the 
patterns  of  latency  cell  means  over  facets  suggested  more  about 
the  processes  or  strategies  subjects  were  using  to  solve  the 
items.  Correlations  betueen  individual  contrast  scores  and 
reference  tests  provided  a  powerful  method  for  examining  these 
relationships . 

3.  Large  interactions  betueen  facets  implied  that  a  simple 
additive  mathematical  model  or  process  model  that  postulated 
independent  stages  would  not  describe  these  data  well. 

Complexity  or  difficulty  increased  nonlinearly. 

4.  Since  tine  and  correctness  related  differently  to  external 
variables,  a  complete  analysis  must  examine  both.  A  method 
designed  to  analyse  their  joint  effects  uas  proposed.  Time  and 
correctness  were  first  independently  standardized  to  zero  mean 
and  unit  variance.  The  sum  and  difference  of  these  tuo  scores 
were  then  computed.  The  difference  score  represented  a  composite 
index  of  difficulty,  uhile  the  sum  score  reflected  the  unique 
contribution  of  each  variable,  or  variations  in  s peed -accu r ac y 
tradeoff.  Although  new  relationships  sometimes  emerged  uhen 
contrast  scores  for  these  variables  were  correlated  with 
reference  constructs,  it  uas  concluded  that  correlational 
analyses  were  best  performed  on  the  original  latency  and 
correctness  variables.  The  composite  variables  were,  however, 
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quite  appropriate  for  the  means  analysis. 


5.  The  relationships  between  overall  performance,  performance 
on  a  particular  item  type,  the  patterns  of  performance  over 
facets,  and  reference  tests  provided  insights  into  the  nature  of 
the  task  and  the  nature  of  the  reference  constructs.  Total 
number  correct  uas  strongly  correlated  with  a  cluster  of  complex 
spatial  tests  called  Visualization  in  the  factor  analytic 
literature.  Performance  on  the  construction  facet  uas 
consistently  related  to  Form  Board  and  Object  Assembly, 
suggesting  that  mental  construction  may  indeed  be  an  important 
spatial  skill.  Similarly,  performance  over  the  rotation  facet 
uas  related  to  performance  on  two  tests  (Cards  and  Figures)  that 
usually  define  the  factor  called  Spatial  Relations.  The  Closure 
Speed  factor  uas  related  to  time  spent  on  constructions  that 
yielded  a  lou  complexity  product.  Other  clusters  of  tests 
representing  such  diverse  factors  as  Reasoning  (or  Humber),  Word 
Fluency.  Visual  Memory,  and  Memory  Span  were  also  related  to 
particular  patterns  of  performance  on  the  task. 

6.  Several  important  aspects  of  the  spatial  construction  task 
were  identified.  In  particular,  mental  construction  uas 
differentially  affected  by  the  number  of  stimulus  elements  to  be 
combined,  the  complexity  of  these  stimuli,  the  location  (or  type) 
of  the  addition,  and,  most  significantly,  the  complexity  of  the 
to-be-constructed  stimulus.  Further,  large  increases  in  latency 
over  construction,  stimulus  complexity,  and  type  of  addition  were 
related  to  verbal  ability  and  verbal  solution  strategies.  Rapid 
construction  on  lou  product  complexity  items  uas  related  to 
Closure  Speed,  uhile  good  overall  performance  uas  related  to  Form 
Board  and  Object  Assembly.  Finally,  differential  correlations 
with  Hidden  Figures  and  Marks  Vividness  of  Visual  Imagery 
Questionaire  suggested  that  holistic  and  analytic  spatial 
construction  strategies  must  be  distinguished. 

7.  Analysis  of  th  rotation  items  confirmed  Cooper  and 
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Podgorny's  (1976)  finding  that  rate  of  rotation  uas  independent 
of  stimulus  complexity.  Examination  of  individual  differences 
added  that  steep  reaction  tine  slopes  for  rotation  were  related 
to  verbal  ability,  short  term  verbal  memory,  and  verbal 
strategies.  Further,  large  increases  in  rotation  time  for 
complex  stimuli  uere  also  associated  with  verbal  memory  span 
tests. 

8.  Some  subjects  attempted  to  solve  some  items  on  this  task 
using  nonspatial  strategies.  However,  since  total  SCORE’S  uere 
highly  correlated  with  performance  on  other  complex  spatial 
tests,  these  tests  are  also  suspect.  Those  uho  have  studied 
retrospective  reports  of  solution  strategies  on  spatial  tests 
have  reached  the  sane  conclusion  (see  tollman.  1979a,  for  a 
revieu).  The  problem  is  particularly  acute  on  complex  spatial 
tasks.  Sone  subjects  solve  easy  items  using  spatial  strategies, 
but  find  that  these  strategies  break  doun  as  iten  difficulty 
increases.  They  then  resort  to  v e r ba 1  - a n a  1 y t i c  strategies. 
Strategies  also  change  with  practice.  Attempting  to  eliminate 
individual  differences  in  solution  strategy  by  keeping  the  task 
simple  uill  not  do,  as  simple  tests  are  also  solved  in  different 
ways  by  different  subjects.  Further,  as  this  study  has  shown, 
individual  differences  in  speed  of  solving  simple  spatial  tasks 
are  largely  independent  of  more  general irable  level  scores  on 
complex  tasks  of  the  same  type.  Future  research  on  spatial 
ability  must  attend  to  these  important  intra-subject  and  inter¬ 
subject  differences  in  solution  strategy.  One  way  to  do  this  is 
through  experiments  that  are  designed  to  yield  qualitatively 
different  patterns  of  scores  for  different  solution  strategies. 
Proportional  subject  sampling,  selective  item  administration,  and 
modeling  performance  at  the  level  of  item  types  within 
individuals  are  also  recommended  as  part  of  the  proposed  research 
strategy. 
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